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Foreword 


Yvonne Choquet-Bruhat has made many deep and lasting contributions 
to mathematical and physical aspects of General Relativity, starting 
with her epoch-making 1952 proof of the well-posedness of the Cauchy 
problem for Einstein’s equations. We are all very fortunate that she 
has undertaken to present, in terms accessible to all, a comprehensive 
account of all the aspects of General Relativity. Indeed, this beautiful 
book is quite unique both in the range of topics it covers and in the way 
each topic is treated. 

First, the range of topics presented by Yvonne Choquet-Bruhat is 
truly remarkable. She covers successively the basics of Riemannian 
geometry (Chapter I) and of Special Relativity (Chapter IT); the math- 
ematical and physical definition of General Relativity (Chapter III); 
the main properties and consequences of Einstein’s field equations 
(Chapter IV); the mathematics and physics of the Schwarzschild 
spacetime (Chapter V); a mathematically rigourous discussion of black 
holes (Chapter VI); a brief, but up to date, discussion of relativistic 
cosmology (Chapter VII); a thorough presentation of the Cauchy 
problem (Chapter VIII); and last, but not least, a detailed discussion 
of two of the most important phenomenological matter models, namely 
relativistic fluids (Chapter IX) and the relativistic kinetic theory of 
N-particle systems (Chapter X). 

But, most importantly, the treatment of each one of these broad topics 
is both very comprehensive and remarkably concise. She has succeeded in 
reaching a Landau- and Lifshitz-like ideal of covering all the crucial issues 
in the most concise way, while expounding each topic in a mathematic- 
ally rigourous way. This rare combination of qualities makes this book 
particularly valuable. For instance, her discussion of the Schwarzschild 
spacetime covers, in one go: (i) a derivation of the solution that includes 
a proof of Birkhoff’s theorem; (ii) the form of the solution in five different 
types of coordinates; (iii) a preliminary discussion of the event horizon 
(which is developed in the following chapter); (iv) the motion of planets 
and of light, and their comparison with the most recent observations; (v) 
the stability of circular orbits; (vi) a presentation of Fermat’s principle 
that includes its little-known generalization to arbitrary spacetimes due 
to G. Ferrarese; (vii) the redshift and time-delay effects; (viii) a dis- 
cussion of spherically symmetric interior solutions that includes several 
theorems about their general properties; (ix) the Reissner-Nordstrém so- 
lution; (x) the Schwarzschild solution in any dimension; and (xi) a precise 
but concise account of the results of Gu Chao Hao, Christodoulou, and 
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others on spherically symmetric gravitational collapse. The chapter is 
then nicely capped by some problems, followed by their solutions. 

I am sure that this remarkably concise and complete book by Yvonne 
Choquet-Bruhat will allow many readers to achieve a deep understand- 
ing of General Relativity through her unique mathematico-physical 
knowledge of one of the fundamental pillars of modern physics. Thank 
you, Yvonne for sharing with us the deep wisdom you have acquired 
during your lifelong exploration of the beautiful theoretical landscape 
opened, nearly a century ago, by Albert Einstein. 


THIBAULT DAMOUR 
Institut des Hautes Etudes Scientifiques 


Preface 


Special Relativity (1905) and General Relativity (1915), both due to 
Einstein’s genius, are fundamental pillars of modern physics. They have 
revolutionized the scientific concepts of space and time, formerly due 
to everyday experience at a human scale, and also to previous scien- 
tific work and experiments made with clocks, very precise and reliable 
at the considered scale. These concepts of absolute space and absolute 
time had become ingrained in the minds of common folk as well as 
scientists, and were difficult to overthrow for a long time.! Nowadays, 
the world explored by humanity, at microscopic as well as cosmological 
scales, has become very much greater, and old concepts have had to be 
replaced by more general ones. Spectacular improvements in technology 
have changed the lives of a large part of humanity, and new information 
processes have permitted easier access to scientific knowledge and the 
acceptance of new concepts. 

Modern physical theories have a mathematical formulation, often 
geometrical, with consequences deduced from mathematical theorems. 
The validity of a physical theory results from the verification of its 
consequences by observations or experiments. 

General Relativity is a beautiful geometric theory, simple in its gen- 
eral mathematical formulation, which has numerous consequences with 
striking physical interpretations: gravitational waves, black holes, cosmo- 
logical models. Several of these consequences have been verified with a 
great accuracy. The Einstein equations present a wide variety of new and 
interesting mathematical problems with possible physical interpretation. 

The aim of this book is to present with precision, but as simply as pos- 
sible, the foundations and main consequences of General Relativity. It 
is written for an audience of mathematics students interested in physics 
and physics students interested in exact mathematical formulations— 
or indeed for anyone with a scientific mind and curious to know more of 
the world we live in. The mathematical level of the first seven chapters 
is that of undergraduates specializing in mathematics or physics; these 
chapters could be the basis for a course on General Relativity. The next 
three chapters are more advanced, though not requiring very sophisti- 
cated mathematics; they are aimed at graduate students, lecturers, and 
researchers. No a priori specialized physics knowledge is required. These 
chapters could serve as the text for a course for graduate students. 

I wish to everyone a good trip in this strange but fascinating world. 


YVONNE CHOQUET-BRUHAT 


lFor instance, the fact that two twins 
who live different lives age differently 
was called the ‘twin paradox’, although 
in fact there is no paradox there, except 
in the human-built definition of time. 
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Notation 


The sign = denotes an identity and the sign := a definition, although 
sometimes when there is no confusion possible I denote by the simple 
sign = one or the other of these. 

In the text, I usually denote by a single letter a geometric ob- 
ject, for example X or T for a vector or tensor. In equations, I 
prefer to use indices, since this makes operations simpler to write 
and understand—without speaking of possible numerical or algebraic 
computing applications. 

The spacetime metric signature is — ++ . . .+. In the Lorentzian case, 
the spacetime indices are Greek letter. When specified, a time index is 
denoted by zero, and the space indices by Latin letters from the middle 
of the alphabet, i,7,.... 

Definitions of new notions are in bold characters and are assembled 
in the index. 


Note on references 


I have not tried to find for each result an original source. This would 
have been a difficult and sometimes controversial task. In the side notes, 
I give references, assembled at the end of the book, to papers I have used, 
where the interested reader can find relevant details, complements, and 
references to previous papers about the subjects treated. 


e The reference YCB-OUP2009 is to Yvonne Choquet-Bruhat General 
Relativity and the Einstein Equations, published by Oxford University 
Press in 2009. 

e The references CB-DMI and II are to Y. Choquet-Bruhat and 
C. DeWitt-Morette, Analysis, Manifolds and Physics, Parts I and II, 
published by North-Holland in 1982 and 2000, respectively. 
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Part A 
Fundamentals 


This part contains the mathematical definitions and physical inter- 
pretations necessary for a basic understanding of Einstein’s General 
Relativity. The basic definitions and properties of Riemannian and 
Lorentzian geometry are given. Special Relativity is introduced, with 
discussion of proper time, the equivalence of mass and energy, and phys- 
ical observations. The spacetimes of General Relativity are defined, and 
their basic mathematical properties and physical interpretations are de- 
scribed, along with comments on experimental results. The theoretical 
foundations of the Einstein equations are given and these equations are 
derived. The Newtonian and Minkowskian approximations of their so- 
lutions are described. Gravitational waves and gravitational radiation 
are introduced. The spherically symmetric Schwarzschild spacetime is 
deduced from the Einstein equations in vacuum, its mathematical conse- 
quences are computed, and their observational confirmation is described. 
Black holes, a phenomenon unknown to Newtonian mechanics, but 
predicted by the Einstein equations, are discussed. Both spherically 
symmetric and axisymmetric (Kerr) black holes are constructed and 
studied. Some mathematical results and conjectures are introduced, and 
an introduction to the thermodynamics of black holes and Hawking ra- 
diation is provided. Relevant astrophysical observations are described. 
The construction of the first cosmological models—Einstein, de Sitter, 
and anti-de Sitter—is given. General Robertson—Walker spacetimes, the 
cosmological redshift, and the Hubble law are discussed. Friedman- 
Lemaitre universes and the presently accepted ACDM model with 
accelerated expansion are described, as is the current majority view on 
the content of the universe, including the mysterious dark matter and 
dark energy. A brief account of primordial cosmology is given. 


Riemannian and 
Lorentzian geometry 


I.1 Introduction 


We give in this chapter a survey of the basic definitions of Riemann- 
ian and Lorentzian differential geometry! that are necessary in General 
Relativity. 


I.2 Differentiable manifolds and mappings 


1.2.1 Differentiable manifolds 


The fundamental arena of differential geometry is a differentiable mani- 
fold. For the physicist, the most concrete and useful definition makes 
apparent the local identification of a manifold with R”, the space of sets 
of n real numbers with its usual topology.” The definition proceeds as 
follows. 

A chart on a set X is a pair (U,¢), with U a subset of X, called 
the domain of the chart, and ¢ a bijection from U onto an open set u 
of R”, i.e. a one-to-one invertible mapping ¢: U > u by z > ¢(x) = 
(x!,...,2"). The numbers x’, i = 1,...,n, are called local coordinates 
of the point x € X. 

An atlas on X is a collection of charts (Ur, dr), with {J} an arbitrary 
set of indices, whose domains cover X. 

A mapping f between open sets of R”, f : u— v, is called a homeo- 
morphism if it is bijective and if f and its inverse mapping f~! are 
continuous. A bijective mapping is a C! diffeomorphism if f and f~! 
are of class C*. 


Exercise I.2.1 Prove that a C! diffeomorphism with f of class C* is 
a C* diffeomorphism. 


Hint: 0(f f~')/dz* = 0. 


An atlas endows X with the structure of a topological manifold, of 
dimension n, if the mappings @; o os" are homeomorphisms between 
the open sets of R”, é;(U; N Ur) and (Uz N Ur). The mappings 
pro 7t define changes of local coordinates in the intersection Uz N U7, 
(x1,...,£”) > (yt,...,y”). The manifold is a differentiable mani- 
fold of class C* if these mappings are C* diffeomorphisms; that is, if 


I.1 Introduction 3 
I.2 Differentiable manifolds 
and mappings 3 


I.3 Tensors and tensor fields 7 


I.4 Structure coefficients 


of moving frames 10 
I.5 Pseudo-Riemannian 
metrics 11 
I.6 Causality 14 
I.7 Connections 16 
I.8 Geodesics, another 
definition 19 
I.9 Curvature 20 
I.10 Geodesic deviation 22 
I.11 Linearized Ricci tensor 23 
1.12 Physical comment 24 
I.13 Solutions of selected 
exercises 24 
I.14 Problems 28 


1More complete treatments can be 
found in many textbooks, in particular 
in CB-DMI. 


2 A basis of open sets in R comprises the 
sets determined by the order relation 
a < x < b. The topology of R” is the 
direct product of n copies of R. 


4 Riemannian and Lorentzian geometry 


the functions y’(a!,...,2") are of class C} and the Jacobian determin- 
ant D(y)/D(x) with elements the partial derivatives Oy'/Ox) is different 
from zero. Henceforth, by smooth we shall mean of class C} with k large 
enough for the statement under consideration to be true; a particular 
case is k = oo. 

Two O* atlases on X are said to be equivalent if their union is again 
a C® atlas. We consider them to define the same C* manifold. It is 
known that any C! manifold can be endowed with a C% structure. 
Unless otherwise stated, all manifolds considered here are assumed to 
be smooth. 

The given definition of a manifold does not imply that it is a Hausdorff 
topological space; that is, it admits the possibility for two points not to 
have non-intersecting neighbourhoods. In the following, we will, unless 
otherwise stated, only consider Hausdorff manifolds and call them 
simply manifolds. 

Examples of manifolds are the sphere and the torus. 

An open set of R” is oriented by the order of the coordinates 
(z',...,a”"). A differentiable manifold is said to be orientable (and 
oriented by the choice of coordinates) if its defining atlas is such that 
D(y)/ D(x) > 0 in all intersections of domains of charts. 

Unless otherwise stated, the manifolds considered here will be C™, 
connected and oriented. 


1.2.2 Differentiable mappings 


A function f on an n-dimensional manifold V” is a mapping V” — R 
by «+ f(a). Its representative in local coordinates of the chart (U, ¢) 
is a function on an open set of R”, fg := fog !: (xt) f(d7'(a")). 

The function f is of class C} at x if fy is of class C* at ¢~'(x). This 
definition is chart-independent if V” is smooth. The gradient, also called 
differential, of f is represented in a chart by the partial derivatives of 
fy. If (U, ġ) and (U’, ¢’) are two charts containing x, it holds that at x 
(calculus relations) 


afe  Əfy Oat 
Oxt  ƏðxrÜ Əri” 


(1.2.1) 


This equivalence relation entitles the differential of f to be called a 
covariant vector (see below). A covariant vector is a geometric object, 
independent of a particular of coordinates choice. 

A differentiable mapping f between differentiable manifolds and 
diffeomorphisms can be defined analogously. The differential of f at 
xE W”, the source, is represented in a chart around x and a chart at 
f(x) € VP, the target, by a linear mapping from R” into R?; that is, an 
n x p matrix with elements 8 f®/ðxt, a=1,...,p,i=1,...,n. 


I.2 Differentiable manifolds and mappings 


1.2.3. Submanifolds 


An embedded submanifold of V is the image by an injective 
differentiable mapping of a smooth manifold W. 


1.2.4 Tangent and cotangent spaces 


A tangent vector v to a differentiable manifold V at a point x € V 
is a geometric object. It can be defined through local coordinates as 
an equivalence class of triplets (Ur, ¢7,vg,), where (Uzr, r) are charts 
containing x, while vg, = (v4) i = 1,...,n, are vectors in R”. The 
equivalence relation is given by 


a 

i= vi OFT, (1.2.2) 
Ox 

where x and xf, are respectively local coordinates in the charts (Uz, ¢7) 
and (U;,@7). The vector vg E€ R” is the representative of the vector 
v in the chart (U,¢). The vector v is attached to the manifold by the 
assumption, compatible with the equivalence relation, that the numbers 
vs are the components of vg in the frame of R” defined by the tangents 
to the coordinate curves, curves in R” where only one coordinate varies. 

Tangent vectors at x constitute a vector space, the tangent space 
to V” at x, which is denoted T,V”. The set of pairs (x, 0z), x € V”, 
vz E€ T,V", denoted by TV”, is called the tangent bundle to V”. 

An arbitary set of n linearly independent tangent vectors at x con- 
stitute a frame at x. The natural frame associated to a chart (U, ¢) 
is the set of n vectors eq), i = 1,...,n, such that Cli) = ô}; they are 
represented in local coordinates by tangent vectors to the coordinates 
curves of the chart. The numbers vh, are the components of the vector 
v in the natural frame. Traditionally, indices of components of vectors 
are written upstairs. 

The cotangent space T* to V is the dual of Ty, that is, the space 
of l-forms on Ty, which are also called covariant vectors at x € V, 
the tangent vectors being called contravariant. The components of a 
covariant vector at x in a chart (U, ¢) containing z, are a set of n numbers 
vi, i = 1,...,n, with indices traditionally written downstairs. Under a 
change of chart from (U, ¢) to (U, ¢’), it holds that 


ðxİ 
/ = . 
vi = Uj aah’ (1.2.3) 
Covariant vectors can be defined by this equivalence relation. By (2.1), 
the differential at x of a differentiable function f is a covariant vector, 
called the gradient of f. The natural coframe is the set of differentials 
dx' of the coordinate functions x + zq’. 
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3Even linear, that is also such that 
v(Af) = Av(f), with A a constant in 
R, a property included in the Leibniz 
rule. 


4See, for instance, CB-DMI, II B 1. 


5It has been proved that all orientable 
differentiable 3-manifolds are paral- 
lelizable; this is not true in all other 
dimensions. 


I.2.5 Vector fields and 1-forms 


A vector field on V assigns a tangent vector at x € V to each point x. 

The relations (2.1) and (2.2) show that, given a differentiable function 
f on V, the quantity v( f) defined for points x in the domain U of the 
chart (U,@) by 


LL fy foo (1.2.4) 


is chart-independent: v defines a mapping between differentiable func- 
tions. It is easy to check that this mapping is additive,’ 


u(f +9) = v(f) + (9), (1.2.5) 
and satisfies the Leibniz rule, 
u(fg) = fulg) + gv( F). (1.2.6) 


The properties (2.5) and (2.6) characterize a derivation operator, 
and v(f) is called the derivative of f along v. 
If we take for v a vector of a natural frame eg (i), that is, vi = 6, 
then 
alf og") 


u(f) = ep, (f) = ai (1.2.7) 


Alternatively, a tangent vector at a point x on a differentiable mani- 
fold can be defined, without first introducing its representatives and the 
equivalence relation, as the value at x of a linear first-order derivation 
operator acting on differentiable functions defined in a neighbourhood 
of x.t The natural frame can thus be defined as the set of operators 
ð/ðxrİ, i = Ty, wage 

A covariant vector field on V assigns a covariant vector at x to each 
point x € V. It is called a 1-form on V. 


1.2.6 Moving frames 


A moving frame, often called simply a frame in what follows, in a 
subset U of a differentiable n-dimensional manifold V is a set of n vector 
fields on U that are linearly independent in the tangent space TV at 
each point x € U. A manifold that admits global frames (not necessarily 
global coordinates) is called parallelizable.° 

A coframe on U is a set of n 1-forms 6* that are linearly independent at 
each x € U in the dual space TV. In the domain U of a coordinate chart, 
a coframe is specified by n linearly independent differential 1-forms 


0 = aide’, (1.2.8) 


with a$ functions on U. 


The differential of a differentiable function f is a covariant vector field 
called an exact 1-form 


Tga a; FO". 


df := gg 


Its components 0, f in the coframe 6 are called Pfaff derivatives. 
The dual frame to a coframe 6” is the set of vector fields such that 


O° (ej) = 5. (1.2.9) 


The natural frame 0/0z‘ and the natural coframe dz’ are dual. 


1.3 Tensors and tensor fields 


1.3.1 Tensors, products and contraction 


A covariant p-tensor at a point x € V can be defined as a p-multilinear 
form on p direct products of the tangent space TV with itself. Con- 
travariant tensors can be defined as multilinear forms on direct products 
of the cotangent space TV. 

Tensors, contravariant or covariant, can also be defined, like vec- 
tors, through equivalence relations between their components in various 
charts. For instance a 1-contravariant 2-covariant tensor T at x € V is 
an equivalence class of triplets (Ur, ¢1,T¢,,:;"), i,j, = 1,...,n, with 
the equivalence relation being the law of change of components of T by 
change of local coordinates from (U, ¢) to (U,¢’) given by 


, Ôx? Aa! dah 


= aoa a lkl 
Ox" OxI' Ox™ 


The space of tensors of a given type is a vector space, in which summa- 
tion of elements and their multiplication by scalars are defined in terms 
of these operations acting on components. 

The tensor product S & T of a p-tensor S and a q-tensor T is in- 
trinsically defined: it is the (p + q)-tensor with components defined by 
products of components. For example, the tensor product of a covariant 
vector w with a contravariant 2-tensor T is the mixed 3-tensor w @ T 
with components 


Tpz ” m (1.3.1) 


(w D T)9* = uT. (1.3.2) 


Although products of components are commutative, tensor products are 
non-commutative: w ® T and T & w of the previous example do not 
belong to the same vector space. A p-covariant and q-covariant tensor is 
an element of the tensor product of p copies of Tẹ, and q copies of T™, 
but different orders of these products give different spaces of tensors: 
T* @T, #T, ® Te. 

A basis of the vector space of tensors of a given type is obtained by 
tensor products of vectors and covectors of bases. For example, a natural 
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6See, for instance, CB-DMI, III C. 


basis of the space of covariant 2-tensors at x is dx’ & dx: 
T = T,;dx' Q dz’. (1.3.3) 


The symmetry [respectively antisymmetry] properties, for ex- 
ample Ti; = Tj; [respectively Ti; = —Tj;], are intrinsic properties (i.e. 
independent of coordinates). 


Exercise 1.3.1 Check these properties. 


The contracted product of a p-contravariant tensor and a q- 
covariant tensor is a tensor of order p+ q — 2 whose components are 
obtained by summing over a repeated index appearing once upstairs 
and once downstairs. 


1.3.2 Tensor fields. Pullback and Lie derivative 


Tensor fields are assignments of a tensor at x to each point x of the 
manifold V. Differentiability can be defined in charts; the notion of a C* 
tensor field is chart-independent if the manifold is of class C*+?. 

The image of a contravariant tensor field on V under a differentiable 
mapping between differentiable manifolds f : V — W is not necessarily 
a tensor field on W unless f is a diffeomorphism. 


Exercise 1.3.2 Prove this statement in the case of a vector field. 


The pullback f* on V of a covariant tensor field on W is a covariant 
tensor field. For instance, for a covariant vector w and a mapping f given 
in local charts by the functions y% = f%(x*), with y% coordinates on W 
and zt coordinates on V, the pullback is defined by 

x oy” 
(Fwi) = Frwalula). (1.3.4) 


Exercise I.3.3 Extend the definition and property to covariant 
tensors. 


The Lie derivative of a tensor field T with respect to a vector field 
X is a derivation operator from p-tensors into p-tensors. If f; is the one- 
parameter local group of diffeomorphisms? generated by X, then the Lie 
derivative at x € V of a contravariant tensor T is defined by 


(LxT)(x) = im[( F Y T(E)) — T2). (1.3.5) 
For example, for a contravariant 2-tensor, it is computed to be 
; ,OTI* p OXI wi” 
T = X _ — TH _ Ph, 1.3. 
(ExT) Ox? Ox* Ox* (12:0) 


The Lie derivative of a vector field X in the direction of the vector field 
Y is called the Lie bracket of X and Y; it has components 


(LxY)Í = |X, YP = XŻaY I — Y'AXİ. 


An analogous definition gives Lie derivatives of covariant tensors. For 
instance, for a covariant 2-tensor, 


_ Oe, ee OX? „ðX! 


Exercise 1.3.4 Check that the quantities (3.6) [respectively (3.7)] 
transform as the components of a 2-contravariant tensor [respectively 
a 2-covariant tensor] under a change of coordinates. 


(1.3.7) 


1.3.3. Exterior forms 


A totally antisymmetric covariant p-tensor field is also called an ex- 
terior p-form. A natural basis of the space of p-forms is obtained by 
antisymmetrization. We call the p-antisymmetrization operator the 


mixed tensor with components aes = 1 if %...% is an even per- 
mutation of 12...p and eee = —1 if it is an odd permutation The 
components of a p-form can be written 


12...p 
Wi .ip = Eii ...ip™12...p 
A p-form is determined by the data of its component with indices of 
increasing order. 
We define the exterior product of exterior forms by antisymmetriza- 
tion of tensor products. A basic example is the exterior product of two 


1-forms given by” 
dx’ Adz? = —dx A da" := 5 (da" ® dx’ — dx] ® dr’). (1.3.8) 
Using exterior products, a p-form reads in a natural coframe 


dx A... A dx’, (1.3.9) 


w= ~ Wii...ip 

The exterior derivative of a p-form is a (p + 1)-form given in local 

coordinates by 
1 


; ; 1 0 
dw i= ~ dui, ...ip dT" A... Ndr”? =— 
p! 


I Jrg iio AO? Adr” A... Ndr”. 


(1.3.10) 


A form whose differential is zero is called a closed form. A form that 
is the differential of an exterior form is called an exact form. An ex- 
act form is a closed form. The reciprocal of this property is true on 
manifolds diffeomorphic to R” (Poincaré lemma), but not on general 
manifolds. A necessary and sufficient condition is given by the de Rham 
theorem. 


Example 1.3.1 Any n-form on the compact n-manifold S” is closed, 
but is an exact differential only if its integral on S” vanishes. 
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TNote that there is an alternative def- 
inition of the exterior product in which 
the factor of 4 is omitted: dx’ A dxi = 
dx* Q dxi — dxi Q dx’. 


8The sufficient condition is that the in- 
tegral of the p-form is zero on p-cycles. 
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The interior product of a p-form with a vector field X is the 
(p — 1)-form given in local coordinates by the formula 


1 a 
(ixw)ir...ip-1 = eo Whit. igsi? 


Lemma I.3.1 The Lie derivative of an exterior form w with 
respect to a vector field X is given by the formula 


Lxw =ixdw+dixw). 


Exercise I.3.5 Prove this formula for 2-forms by using local coordin- 
ates. 


I.4 Structure coefficients 
of moving frames 


For the natural coordinates x? in a chart of a manifold, which are func- 
tions « +> qf, it holds that Ox'/Ox) = 6%, d?x' = 0. However, for a 
moving coframe in a domain U, the differentials of the 1-forms 6* do not 
vanish in general: they are given by 2-forms 


. ice, o3 
do" = — 5 Cin" A a (1.4.1) 


The functions Ci n on U, antisymmetric in j and k, are called the 
structure coefficients of the frame. 

The Pfaffian derivatives 0; in the coframe 6 of a function f on U are 
defined by df = 0;f 0°. The identity d? f = 0 implies that 


hence, Pfaffian derivatives in contrast to ordinary partial derivatives, do 
not generally commute, 


Exercise [.4.1 Show that the structure coefficients of a coframe 
0 := ajdx/ are given by 


i — Ajg „i jg „i 
Chr = Apna; — A Oha; 
where A is the matrix inverse of the matriz a. 


We note that the basis e; dual to 6° satisfies the commutation 
conditions 


where [.,] denotes the Lie bracket of vector fields: 


[v, w] := Lyw = —Lyv. 


I.5 Pseudo-Riemannian metrics 


1.5.1 General properties 


A pseudo-Riemannian metric on a manifold V is a symmetric covari- 
ant 2-tensor field g such that the quadratic form it defines on contravari- 
ant vectors, g(X, X), given in local charts by g;; X’ X7 , is non-degenerate; 
that is, the determinant Det(g) with elements gij does not vanish in any 
chart; this property is independent of the choice of the charts because, 
under a change of local coordinates (x) — (a), it holds that 


Det(g) = Detto’) ( FEL) (1.5.1) 


The inverse of the matrix (gij) is denoted by (g’’). 


Exercise I.5.1 Show that the Kronecker symbol 5, & = 1 fori = j, 
6! =0 fori Æ j, is invariant under change of frame. 

Show that the g’ are the components of a contravariant symmetric 
2-tensor. 


A metric is traditionally written in a moving frame: 
g = 9i30'0", ie., in a natural frame, g=gijdr'dx?. (1.5.2) 


It is known from algebra that any non-degenerate quadratic form over 
the reals can be written as a sum 


gijdxîdzi = X` e;(0)? with ei =4 


i 


1, (1.5.3) 


where the 6t are independent real linear forms of the dz’, that is, a 
moving frame. Given g, this decomposition can be done in many ways, 
but the number of c; that are equal to +1 and the number that are equal 
to —1 are independent of the decomposition; this is called the signature 
of the quadratic form. A moving frame where a metric takes the form 
(5.3) is called an orthonormal frame, whatever its signature. 

A pseudo-Riemannian metric g on V defines at each point x € V 
a scalar product, that is, a bilinear non-degenerate function on the 
tangent space TV: 


(v, w) := ga(v, w) = gij(x)v'w?, v, w €TV. 


Two vectors of a tangent space T,,V are said to be orthogonal if their 
scalar product vanishes. 

Through contracted products with the metric or its contravariant 
counterpart, one associates canonically contravariant and covariant ten- 
sors, for example 


Tij => Gihgjk T”. (1.5.4) 


An isometry of the pseudo-Riemannian manifold (V, g) is a diffeo- 
morphism f that leaves g invariant; that is, 


(f*g)(x) = g(x) at each point x € V. (1.5.5) 
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°For integration of forms on manifolds, 
see for instance CB-DMI, IV B 1. 


10That is, e; = +1 for all i in the 
formula (5.3). 


If a metric g is invariant under the action of a one-parameter group 
generated by the vector field X, then its Lie derivative with respect to 
X vanishes; the converse is locally true. The vector field X is then called 
a Killing vector of g. 


Exercise 1.5.2 Show that the Lie bracket of two Killing vectors is also 
a Killing vector. 


A pseudo-Riemannian manifold of dimension n admits at most 
n(n + 1)/2 Killing vectors. 

Two pseudo-Riemannian manifolds (V,g) and (V’,g’) are called lo- 
cally isometric if there exists a differentiable mapping f such that any 
point z € V admits a neighbourhood U, and f(x) a neighbourhood U’, 
with (U,g) and (U’, 9’) isometric. Locally isometric manifolds have the 
same dimension but can have different topologies (for example, a plane 
and a cylinder). 

A pseudo-Riemannian manifold is called a flat space if it is isometric 
with a pseudo-Euclidean space, that is, R” with metric 


gigdx' dex! = yates with ¢; =1. 


It is called locally flat if it is locally isometric to a flat space. 
The volume form of the metric g is the exterior n-form that reads 
in local coordinates 


Wg = [Detg|? dx! A...A dz”. (1.5.6) 
This exterior form induces, on domains oriented by the order z!,...,2”, 
a volume element on a pseudo-Riemannian manifold, often denoted 
by the same symbol w,, although the volume element concerns only 
oriented manifolds. It reads, 


Lg = \Detg|? dz! sade", (1.5.7) 


the order of the differential dz’ is irrelevant in integration (Fubini’s 
theorem).? 


I.5.2 Riemannian metrics 


A metric is called Riemannian (or, for emphasis, properly Riemannian) 
if the quadratic form defined by g is positive-definite.!° 

If v is a tangent vector at x to a Riemannian manifold (V,g), the 
non-negative scalar 


a (v,v)? = [gij(x)v'v |? (1.5.8) 


is the norm (in g) of v;it vanishes only if v is the zero vector, whose 
components are v’ = 0. 


I.5 Pseudo-Riemannian metrics 13 


The length ¢ of a parametrized curve \+> 2(A) joining two points of 
V with parameters À = a and A = b is 


dx’ dxi 2 


Elementary analysis shows that the definition is independent of the par- 
ametrization. The distance between these two points is the lower bound 
of the length of arcs joining them. A geodesic of a properly Riemannian 
manifold is a curve for which this bound is attained. 

Distance endows a properly Riemannian manifold with the structure 
of a metric space with a topology that coincides with its topology as 
a manifold. Riemannian manifolds and their topologies have been ex- 
tensively studied by differential geometers, and their metrics are closely 
related to their topological properties. They are of interest to General 
Relativists as possible properties of space (see Chapter VII). 


I.5.3 Lorentzian metrics 


A pseudo-Riemannian metric g is called a Lorentzian metric if the 
signature of the quadratic form defined by g is (— ++--++). In the case 
of a manifold with a Lorentzian metric, we denote its dimension by n+ 1 


and use Greek indices a, 8,... = 0,1,2,...,n for local coordinates and 
local frames. A Lorentzian metric is a quadratic form 
9 = Japdz” dz? (1.5.10) 


that admits a decomposition as a sum of squares of 1-forms 0° with the 
following signs, independent of the choice of these 1-forms: 


g=- + X (P), 0% =a%sde". (1.5.11) 

i=1,...n 
Remark 1.5.1 We adopt the ‘mostly plus’ MTW convention.!! Some 
authors prefer the opposite convention, giving to Lorentzian metrics the 
signature (+——...—). Each of these conventions has its advantages and 
disavantages, but of course they give equivalent geometrical results (note, 


however, that, surprisingly, this is not always true on non-orientable 
manifolds"? ). 


A metric of fundamental physical importance in the case n = 3 is the 
Minkowski metric. For a general n, it is the flat metric on R"*+ that 
reads 


g=—(dx)?? + So (dx'y?. (1.5.12) 


i=1,2,...,.n 
The group of linear maps of R"*!, 
9% = L260, (1.5.13) 


that preserves orthonormal frames (i.e. frames 0° where the metric takes 
the form (5.12)) is called the Lorentz group L”*?. 


11 Misner—Thorne—Wheeler. 


12See CB-DMII Chapter I Section 7, 
Pin and spin groups. 
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I.6 Causality 


I.6.1 Causal and null cones 


At each point x of a Lorentzian manifold V, one defines in TV, the 
tangent space to V at x, a double cone C, called the causal cone, by 
the inequality 


ga(v,v) <0, v ETV. (1.6.1) 


The boundary of C, in TeV is the double cone F}, called the null cone 
or light cone, 


Ga (v,v) = 0. (1.6.2) 
A vector v € T,v is called causal if 
Ga (v,v) < 0. (1.6.3) 
It is called timelike [respectively spacelike] if 
Gx(v,v) <0 [respectively gz(v,v) > 0]. (1.6.4) 
It is called a null vector, not to be confused with the zero vector, if 


Ja(v,v) =0 (1.6.5) 


1.6.2 Future and past 


The causal cone C, splits into two convex cones, C and C7, charac- 
terized in an orthonormal Lorentzian frame by the properties v € Cy 
and 


Cr: Ow)sex so, Cl: vw <0. (1.6.6) 


If it is possible to choose this splitting continuously on V, the manifold 
is said to be time-orientable. It is time-oriented by the choice. Unless 
otherwise stated, the manifolds called Lorentzian manifolds are oriented 
manifolds with time-oriented Lorentzian metrics. 

A causal vector is future- [respectively past-] oriented if it is such 
that v € Cy [respectively v € C7]. 

A curve y joining Ta to £, in a manifold V is the image in V of a 
segment of R, A y(å) € V, a < àA < b. The tangent to y at a point 
7(A) is the derivative dy/dA, a vector in Tya) with components dy*/dA 
in local coordinates. 

In a Lorentzian manifold (V, g), a future causal curve is a curve with 
future causal tangent vectors, and a future timelike curve has future 
timelike tangent vectors. The future of a point x is the set of points 
y € V that can be reached from x by a future timelike curve. The future 
of a subset is the union of the future of its points. 


1.6.3. Spacelike submanifolds 


Let M be an n-dimensional differentiable submanifold of the (n + 1)- 
dimensional differentiable manifold V. Assume that f is defined by an 
equation f = 0, with f a differentiable function with non-vanishing 
gradient. Then, we denote by v the contravariant vector associated in 
the metric g to the gradient of f at x € V. 


Exercise I.6.1 Show that v is orthogonal in g to vectors v in TaM. 


The submanifold M is called spacelike [respectively, timelike, null] if 
its normal v is timelike [respectively, spacelike, null]. 
Exercise [1.6.2 Show that in a small enough neighbourhood, local co- 
ordinates of a Lorentzian manifold can always be chosen such that the 


time lines are orthogonal to spacelike submanifolds; that is, the metric 
can be written 


ds? := —N*(dz°)? + gijdx'dx 


1.6.4 Length and geodesics 


The length of a causal curve y joining £a to Zp is 


b dy dy® 3 
l= — o : I.6. 
f ( Jap x) d) (1.6.7) 


Null curves have zero length. 
To define geodesics on a Lorentzian manifold, we replace the inte- 
gral (6.7), which causes problem at points where the curve has a null 
tangent vector, by the integral, which is no longer independent of the 
parameter A, 
dx 


b 
pen with L:=ga gl (A)) ttf, pe D 


A geodesic joining x(a) and 2(b) is defined as a critical point of this 
Lagrangian, that is, a solution of the Euler equations 
d OL OL 
—~— —- ~—— = 0. 1.6.9 
dOr% Ox ( ) 


These equations read explicitly 
d?z8 (2 O9a8 2m) dz? dx? 


(1.6.8) 


29a3 ae T 


dx’ rej dd dA 
that is, because of the symmetry in the indices 8 and y, 
Par +72 dec dz >20. To: l ap Aga Jay _ gp 
dà? aN dà ° PY 2 ðrı ðr? = Ax 
(1.6.10) 


Lemma I.6.1 On a curve solution of the equations (6.10), it holds that 
gag(x(A))#°%" = constant (1.6.11) 
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Proof. Equation (6.11), as can easily be checked directly, is a special 
case of the energy equality satisfied by solutions of the Euler equations 
of a Lagrangian: 

OL 


L- grt = constant. (1.6.12) 


The above lemma shows that the definition (6.10) of geodesics defines 
not only geometric curves but parametrized ones. It implies that the 
critical points of the functional (6.10) are also critical points of (6.7) 
but the converse is not true. 


? 


Exercise I.6.3 Prove this statement. 


It can be proved that a geodesic between two points that are causally 
related realizes, in the Lorentzian case, a local (i.e. among nearby such 
curves) maximum of distance. 


Exercise I.6.4 Check this statement by drawing in a plane with Car- 
tesian coordinates (x, t) a broken causal line joining the origin (0,0) to 
a point (0,a) and comparing the Minkowskian lengths of this line and 
the straight line joining the same points. 


I.7 Connections 


I.7.1 Linear connection 


Partial derivatives of components of tensor fields are not components 
of tensor fields — they do not transform as such under a change of co- 
ordinates. Geometric derivation operators are defined by endowing the 
manifold with a new structure called a connection. These operators are 
called covariant derivatives and are usually denoted by V. They map 
differentiable vector fields into 2-tensor fields and obey the following 
laws: 


V(v + w) = Vv + Vw additivity, (1.7.1) 


V(fv) = fVu+df@v_ Leibnitz rule for a product with a function. 
(1.7.2) 


When acting on a scalar function, they coincide with the differentiation 
operator; that is, 


Vf :=df, a covariant vector. 


Consider in the domain U of a chart a vector field v = vte; in an 
arbitrary frame. By the previous rules, one has 


Vu =v'Ve;, + diñ Q e, (1.7.3) 


and hence Vv is determined in U by the covariant derivatives of the 
basis vectors e;, 2-tensor fields that we write in the chosen frame as 


Ve; = w0 @ en. (1.7.4) 


The wh are functions of the local frame on U. They are called connec- 
tion coefficients. The covariant derivative of a vector reads, by (7.3) 
and (7.4), 


Vv = (Vjt) Qei, with Vjv = ôjvt + wipo". (1.7.5) 


The connection coefficients wh are not components of a tensor;!? their 
transformation law under a change of local frame results from the 


tensorial character of Vv and is found to be 
wh, = ARO, A + AR AT Af uh, (1.7.6) 


where A is the change-of-frame matrix. In the case of a change of natural 
frame, it holds that 


; Oa! 
Ai, = —. 
4 Ox" 
Exercise I.7.1 Prove the connection frame-change formula (7.6). 


Show that the difference of two connections on a manifold V is a tensor 


on V. 


The covariant derivative of a covariant vector is defined so that the 
covariant derivative of the scalar vu; is the ordinary derivative and 
obeys the Leibniz rule: 


Vj(v'u;) =0;(v'u;) = (Ojv')u; + v'(Oj;u;) and 
Vi(v'ui) =(V jv Jui +v (V jus). 
It then follows from (7.5) that 
Vjui := jui — wuh. (1.7.7) 


The covariant derivatives of a (contravariant) vector v or a covariant 
vector u in the direction of another vector w are vectors of the same 
type V,v and V,,u with components 


wViv? and w'Viuj. 


Remark I.7.1 The set of numbers Vivi or Viuj defined as com- 
ponents of 2-tensors can also be interpreted for any given i as the 
components of the vector or covector that is the derivative of v or u 
in the direction of the frame vector eçi). 


The property that the covariant derivative in the direction of a vector 
w of a contravariant or covariant vector is also a contravariant or covari- 
ant vector, respectively, allows us to define the covariant derivatives of 
tensor fields by assuming that, as is usual for derivation operators, they 
satisfy the additivity condition and also obey the Leibniz rule, but here 
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13They define a matrix valued 1-form 
wi given by wł := wh. 
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14One cannot apply the Leibniz rule 
directly to tensor fields, because of the 
non-commutativity of the tensor prod- 
uct: Vu @v and u® Vv do not belong 
to the same vector space. 


only for derivation in the direction of a given vector,!* equivalently on 
representatives (i.e. components). We have, for two arbitrary tensors, 


ValS +T) = VS + ViT linearity, 
Vi(S@T)=V,S@®T+S@Vi.T Leibniz rule for tensor product 
and the derivation V,. 


These general definitions give, for example for the 2-covariant, 2- 
contravariant tensor 


THT" 8 Ge, Gee0, ViT = Ve T 


the components of its covariant derivative, a 3-covariant, 2-contravariant 
tensor 
hk hk m hk h mk k hm mmp hk 
Vily = Tj — wij Im + Wim TG + Wim dj "i = wa Ty m- 


Remark I.7.2 Lie derivatives obey the Leibniz rule 
Lx(S@T)=LxS@T+S@LXxT. 


I.7.2 Riemannian connection 


The Riemannian connection of the pseudo-Riemannian metric g is 
the linear connection w such that 


e The covariant derivative of the metric is zero; that is, 
On9 By — WAIBA — Wagga = 0. (1.7.8) 
e The second covariant derivatives of scalar functions commute: 
VaIef —Ve0af = 0. (1.7.9) 
One says that the connection has vanishing torsion. 


Theorem I.7.1 The conditions (7.8) and (7.9) determine one and 
only one Riemannian connection, given by 


wey = T8, + g Dayu (1.7.10) 
with 
: 1 
Baru = 5 (Gray — Ir Cay — GarC3y)s (1.7.11) 


where the Ce. are the structure coefficients of the moving frame, 
vanishing in a natural frame, while 


1 
ri, = 39” agyu F Os Qe = Oriaz): (1.7.12) 


The quantities T are called the Christoffel symbols of the metric g; 
they are zero for an orthonormal frame and, more generally, when the 


I.8 


Jag are constant. In a natural frame, the connection coefficients reduce 
to the Christoffel symbols, which then read 


ma 1 pu (fee p Wax fen.) (1.7.13) 


2 ox% ox Oxt! 


Applying the general formula (7.6) to a change (x%) + (a’’) of local 
coordinates in a Riemannian connection gives 
B 3x Ax® Ox? 8 dr? ð Ox? 
oy Brb OLV OLV © Ax? Axe’ (Orv j” 
Proposition I.7.1 By a suitable choice of local coordinates, the con- 


nection coefficients can be made to vanish at any one given point, and 
even along any one given curve. 


(1.7.14) 


Exercise 1.7.2 Show that these properties hold. 


This proposition is very useful to remember when computing tensorial 
expressions containing covariant derivatives. 


Exercise I.7.3 Prove that a Killing vector field satisfies the equations 
Vi Xj + Vj Xi = 0. 


I.8 Geodesics, another definition 


1.8.1 Pseudo-Riemannian manifolds 


Parallel transport: a vector field v is said to be parallely transported 
along a curve A +> 2(A) with tangent vector u if it satisfies along this 
curve the differential equation 
P B a _ ax 
U Vau” =0, u= a 
A differentiable curve is called a geodesic if its tangent vector is 
parallely transported: 


(1.8.1) 


u°Vau? = 0; (1.8.2) 
that is, in local coordinates, if 
dz% (du? 
Pr (= + ru’) =0 (1.8.3) 


or, equivalently, 
drê B dx“ dx? 
+ => 
dd? YVAN dX 
Lemma 1.8.1 The scalar 


0. (1.8.4) 


et. dx“ dz? 
T 


is constant along a geodesic with parallely transported tangent u®. 
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Proof. Along a geodesic, it holds that 
4 d 
urugVau® = u*Va(ugu’) = —(ugu®) = 0. | 
It should be noted that the given definition of geodesics concerns par- 
ametrized curves—the property (8.2) is not conserved under a change 
of parameter; indeed, if X = f(A), u” := da*/dX and u” := dx*/dX, 
then 


dX dd? (dN) 
and (8.4) becomes 


du% _ d?a* a” (dN \* | de® @N 
dÀ dN (dd)?’ 


dN 2 2 ma wt a d2 , 
À dg rs da dx | dx À =0, (18.5) 
dd} | (dX’)? YAA dd | dX (dd)? 
or, equivalently, 
a a . d?n dX 2 
u Vau? + c(A)u'® =0 with e(A):= CE (=) f 


Therefore, u’® is parallely transported only if d?X’/(d\)? = 0, i.e. X = 
aà + b; that is, A and A’ are linked by an affine transformation. The 
parameter À on a geodesic, with u® = dz®“/dà parallely transported, 
is called a canonical affine parameter. The geodesics considered in 
Section I.6.4 are canonically parametrized. 


I.8.2 Riemannian manifolds 


In a Riemannian manifold, a geodesic joining two points attains the 
minimum length among neighbouring curves joining these points. 


I.8.3 Lorentzian manifolds 


Since u”%ua is constant along a geodesic, the timelike, null or spacelike 
character of a geodesic is the same along the whole curve. The canonical 
parameter À is proportionnal to arc length if the curve is timelike. 

In contrast to the Riemannian case, we have the following theorem. 


Theorem I.8.1 In a Lorentzian manifold, a timelike geodesic join- 
ing two points attains the maximum length among neighbouring curves 
joining these points. 


I.9 Curvature 


I.9.1 Definitions 


The non-commutativity of covariant derivatives is a geometrical prop- 
erty of the metric, manifesting itself via its Riemannian connection. 


The commutation (VaVg — VeVi of two covariant derivatives of 
a vector v is a 2-covariant, 1-contravariant tensor that is found by cal- 
culus to depend linearly on v, and not on its derivatives; that is, there 
exist coefficients Rag”, such that 


(VaVe— VeVa)u® = Rap rv". (1.9.1) 


These coefficients are the components of a tensor,!? Riem(g), antisym- 
metric in its first two indices, called the Riemann curvature tensor.!® 
The identity (9.1) is called the Ricci identity. 

It can be proved!” that a manifold with vanishing Riemann curvature 
is locally flat. 

Straightforward calculus using the expressions for the components in 
natural coordinates of the covariant derivative of a vector shows that 
the components of the Riemann tensor in natural coordinates are 


0 Ot as LTA Te A 
= mA ies T (1.9.2) 


Si A 
Rag =A Piu 


The Ricci tensor Ricci(g) is defined by the contraction a = À of the 
Riemann tensor and has components 


o o 


— Q a a A 
Rey, = Jra Bu ggh en t Pay an TapE (1.9.3) 
The scalar curvature is 
R := g Rag. (1.9.4) 
The Einstein tensor!® is 
1 
Saa i= Rag — g Gap. (1.9.5) 


Exercise [.9.1 Show that in an arbitrary moving frame, 
Ryu g = dawro — ww + WLW g — Wapige — Wag (1.9.6) 


Remark I.9.1 The Riemann curvature tensor is an exterior 2-form 
taking values in the set of linear maps from the tangent plane to itself. 


1.9.2 Symmetries and antisymmetries 


The Riemann tensor is obviously antisymmetric in its first pair of indices. 
It is straightforward to show that when written in full covariant form 
Rapp = Op Raw i 
it is also antisymmetric in its last pair of indices and satisfies the so-called 
algebraic Bianchi identity 
Rapan + Rear ap + Rro, by = 0. (1.9.7) 


This identity can be used to show that the Riemann tensor is invariant 
under the interchange of these two pairs. One says that it is a symmetric 
double 2-form. 

The Ricci and Einstein tensors are symmetric 2-tensors. 
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15 Because the left-hand side is a tensor 
and v is an arbitrary vector. 


16Note that the sign convention for the 
Riemannian curvature tensor in (1.9.1) 
is that used in CB-DMII; an alterative 
convention is used in CB-DMI. 


17See CB-DMI, V B 2. 


18Denoted Gag by some authors. 
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19One also says that the right-hand 
side is the relativistic tidal force. This 
terminology is inspired by the fact that 
tides on Earth are due to the differen- 
tial of the gravitational attraction (the 
Hessian of the gravitational potential in 
the Newtonian approximation) mainly 
of the Moon, on the seas. 


1.9.3 Differential Bianchi identity and contractions 


The definition of the Riemann tensor implies the differential Bianchi 
identity 


VaRgyan + VeRyarpu + VyRaeryu = 0. (1.9.8) 
Contracting this identity gives the identity 
VaRpy” u — VeRyy + VyRgy = 0, (1.9.9) 


and a further contraction gives the following identity satisfied by the 
Einstein tensor: 


Vas =0. (1.9.10) 


This identity, called the conservation identity, plays a fundamental 
role in the choice of the Einstein equations. 


I.10 Geodesic deviation 


An important phenomenon that signals to observers the curvature of 
spacetime is its influence on the distance between nearby geodesics. 

Let C, be a one-parameter family of geodesics with canonical par- 
ameter denoted by s, Co(s) := v(a,s). Denote by v = ðy/ðs the 
tangent vector to Co and by X = Ow/0o the vector that characterizes 
the infinitesimal displacement of Cg. 


Lemma I.10.1 These two vector fields commute; that is, 
V°Va XÊ — X°V av? =0 (1.10.1) 


Proof. The relation is tensorial and pointlike. It is sufficient to prove it 
at an arbitrary point in particular coordinates. Choose coordinates such 
that at that point the Christoffel symbols vanish. Then 


O28 Ow? 
OVX? — X°V yu? = - =0. 
oN Vav ðsðo ðsðs g -= 


Theorem 1.10.1 The rate of acceleration of distance between nearby 
geodesics is linked with the curvature by the equation 
D? 4 
Ved? = XP = Xv Rya ps (1.10.2) 


Ds? 


called the equation of geodesic deviation.'® 


Proof. Differentiating the relation (10.1) in the direction of v gives, using 
the parallel transport of v, 


VVV Vad? — VVA X V gu? — X%°V,Vau" = 0. (1.10.3) 


Hence, using the Ricci identity, 
VAUVA VaR? - Vi X Vav? — X%y(VoVav? + Rya pv") =0. 
(1.10.4) 
and hence, by the parallel transport of v, 
VUV Vad? — PV X V gu? + XPV gw *V 08 — XU Ryo? yo" = 0. 
(1.10.5) 


The commutation (10.1) of v and X shows that this equation simplifies 
to 


V` VAV a XÍ = Xv Rya p, (1.10.6) 
which can also be written as 
D? a L 
eet te par” = Xw" Rial i (1.10.7) 


I.11 Linearized Ricci tensor 


The linearization,?° also called first variation, of an operator 


P : u P(u) between open sets of normed vector spaces E, and E>, 
at a point u € Fj, is a linear operator acting on vectors du € E> given 
by the (Fréchet) derivative P/ of P at u, that is, such that 


ôP := P' ôu, with P(u+ du) — P(u) = Pi(u)du+ o(|õu|). (1.11.1) 


We consider a pseudo-Riemannian metric, with components gag in 
local coordinates. The relation between gag and the inverse matrix g°®, 


ggap = 53, the Kronecker symbol, (1.11.2) 
implies that 
ôg% = —h** := =g g hyu where we have set hy), = Ogap- 
(1.11.3) 


The definition and the above relation applied to the Christoffel sym- 
bols and the Ricci tensor gives by straightforward computation that 
or g is the following 3-tensor (indices raised with gry: 


1 
Tag = 3 (Vahh + Vaha — V*Rag) - (1.11.4) 


From this formula and the expression for the Ricci tensor, it follows that 
6 Ricci(g) is the linear operator on h := 6g given in local coordinates by 


1 1 
Rag = -3 V°Vahap P (Va Vahh + VaVgha — VaV gh). (1.11.5) 


From this identity, there results the linearization of the scalar curvature: 


ÖR = gP ERa + Ropdg™, 
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20See, for instance, CB-DMI, II A, in 
particular Problem 1. 
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where g°?6R,yg is the divergence of a vector, namely, 


gP bRag = -Va (V*hE —Vah**). (1.11.6) 


1.11.1 Linearized Bianchi identities 


The contracted Bianchi identities 


Vas =0 
imply 
6(V 5°") = 0; 
that is, 
ValS SE) + SATE, + 8° Ore, =0, 
with 


1 
Sò = 5R = 5 (9 "OR +h™ R) =0. 


I.12 Physical comment 


We shall see in the following chapters that in Relativity (and already 
in Newton’s theory) a model for reality at a macroscopic scale is based 
on differentiable manifolds, geometric objects whose elements are called 
points. Each point of, let us say, a 3-manifold is represented by a family 
of sets of three numbers, each set being the coordinates of the point in a 
reference frame; different elements of the family are linked by an equiva- 
lence relation between reference frames. The physically realistic problem 
is to link the abstract reference frames with a concrete observable one. 
We will return to this subject in the following chapters. 


I.13 Solutions of selected exercises 


Exercise I.3.2 Image of a vector field 


Let f: V => W by x > y := f(x), and denote by (y*) coordinates on 
W and by (zt) coordinates on V. The differential of f at x is a linear 
map between T,,V and T,W, denoted by f’, such that 


(oto) = Evet). 


It is not a vector field on W if the inverse mapping of f, f 7t : y > x(y), 
is not defined on W. 


I.19 Solutions of selected exercises 


Exercise I.3.4 Components of Lie derivatives 
For a covariant 2-tensor, the components of the Lie derivative are 


ax! 3X 
xi J” ah” 


(LxT)jn = X’ on a a (1.13.1) 
T 


Under the change of coordinates zê = ¢'(y"), we have X’ = Ai, XS", 
with 


with (AZ) the inverse matrix of (Ai,). Straightforward computation 
gives 


; į ,O A? ATT, , Pee 7 i, m’ 
(ExT )jh =A X" A? ( J h pa’) i ae AY Tuq A O(At, X ) 


Oy” Oy?” 
PEE a( Ai Xm) 
p n h m 
+ A; Aj Ty'n’ Ag Oy 
Since AÏ Ai, = 5, the terms containing no derivatives of the A are 
easily seen to reduce to 
rt Hore ax” ax” roa 
p q m pq ji I = Pp q 
A; Aj, (x ym’ Lmg yr" rpm yt = A; Aj, (LxT ) pq. 
(1.13.2) 
The terms containing derivatives of the A are 
, OAR’ AT) Kd „ðA 
XO ae AA Ae AE 
pq 3y” i h LS ai OyP 
f f 1 1 ðA! 1 
P An q m m 
+ Aj A; Tpw Ah X Oy? , 
which can be written 
„(AP AT) P” „ðA 
KE Tor Ae A Tyg A KA am 
pq 3y” ay i h on Oy?’ 
1 f 1 $ OA? , 
P An q m m 
+ Ay A; Tp Aj, X Oy ` 
Renaming indices then gives 
, [a(a®’ AZ) + + OAR sof OA! 
m J h p q n m! p q n m! _ 
Tyg X “ay + Aj Aj, A; 3y” + Aj Aj h ayn’ = 0. 


Since partial derivatives commute, it holds that 
ðA _ Oat _ OAt, 
Oy™ hai Oy” Oy™ Oy ? 
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and we can write 


, [ac A? AZ) rain Oe gt OAt 
m d h n n! n ee EES 
Tyg X “ay + A? At Aj ayn + At AJ Aj ayn = 0. 


. > 1 . . 
But, since A‘, and A} are elements, of inverse matrices, 


1 ðA, , ðA” a AP 
AP An’ Son! _ gr’ ai, = 
t J Oy’ a n Oy Oy’ 


An analogous computation replacing 7 by h completes the proof that 


(LxT) jn = AP AP (LT) pray 


Exercise I.3.5 Lie derivative of exterior forms 


On a two-dimensional manifold, the maximal order of an exterior form 
is 2: 
Ï , ' 
wis qwigda A dxt = wiodzx' A dx? 
The general formula for the Lie derivative of a covariant tensor gives for 
Lyw the 2-form 


(ene = x pon Hana (=) = x (X*uy2), (1.13.3) 
while 
ixw = wy2(X'dx? — X7dz), dw =0. 
Hence, 
d(ixw) = ee) | Me) ^ da? 
and 


Lxw= d(ixw) = d(ixw) +ixdw. 


Exercise I.5.1 Kronecker symbol and contravariant 
components of g 


A mixed tensor X with components Xi := 0% in coordinates x’ has the 
following components in coordinates z’: 


F ; 
xy’ = Ox’ Ox!) ., 
= |. Pm SP j. 
J Ox? Ox J 
Hence, 
J 
xt = Ox ðr’ sË 
i= Oat Oat — 9%" 


ap P “| A š 
because Ox" /Ox' and Ox'/Ox" are elements of inverse matrices. 


I.19 Solutions of selected exercises 


The elements g?” of the inverse of the metric gij are such that 
9 95h = 55, 


and in another coordinate system they are such that 


al pt i. 
g? gyn = 973 
that is, 
vp Ox) Ox? = 6 
Oa Ba 9" = OF 
Hence, 


ih Un! dx} Oa! 
Ox ðr’ 


and the g’” are the components of a contravariant tensor. 


Exercise I.7.1 Connection frame-change formula 


Let 
i pipt i_9.,i i ah fo i! oh! 
v = Aj’, Vjv = Oj +w, Vy = 0j Hwy”. 
r 7 
We have 0; = A} ôy, with Aj and Aj, inverse matrices, and hence 
y -1 $ i : 1 
tAd A, PE i a hh 
Vjv = Aj Oj ( Apv ) + wip Aprv 
a P y oie : , 
— AJ At 9. yt JIA. qt i i h ah 
= Aj Aj Oyu + (Aj pAr + wipAn” . 
But, since Vv is a mixed tensor, 
Vv = A AY; i = Ai Av 0; i ih! 
jv = Aj AG Vy" = Ay Aj (Opu wj” ). 
Identifying these two expressions for V;v" gives the connection change- 


of-frame formula and the tensorial law for the difference of two 
connections, because the v” are arbitrary numbers. 


Exercise I.7.3 Killing equations 


A Killing vector X of a metric g is such that 
(Lxg)nk = X’ digar + Jinde X’ + guiOnX' = 0. 


Using Vignk = 0 gives the result by straightforward computation. 
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1.14 Problems 


1.14.1 Liouville theorem 
1. Let (V,g) be a pseudo-Riemannian manifold with tangent space TV. 
Prove (Liouville theorem) that the volume form 0 in TV given by 
6 := |Detg|dx® Adz! A... A dx” A dp® A dp! A^... A^ dp” 
is invariant under the geodesic flow with tangent vector X; that is, 
with Lx denoting the Lie derivative with respect to X, it holds that 
Ld =0. (1.14.1) 


2. Show that ix@ is a closed form. 


Solution 


In local coordinates (x4) := (a%,p%) on TV, the Lie derivative of the 
exterior 2(n+1)-form 0 with respect to the vector field X = (p,G), that 
is, (X4) = (p%, G®), reads, using the fact that the components of 0 do 
not depend on p, 


P cP Gs 
(£x6)o1...nn-41...2(n41) =P gga M0l...nn+1...2(n-+1) + OA1...2(n41) On 
ax* ox 
+ 90 A...2(n+1) zT +... + fo1...(2n+1)A Bandi 
(1.14.2) 


The expression for 0 gives that 


o o 
gga l0lL-nn+1..2(n+1) = g? (ov) 901...nn41...2(n4+1)3 (1.14.3) 


on the other hand, since @ is antisymmetric, the same index cannot 
appear twice in its components; therefore, the second line of (1.14.2) is 
equal to (recall that the components p“ of X do not depend on 2“) 


x^ aGe 
901...2(n+1) BoA = 9o1...2(n+1) Ga (1.14.4) 


The expression for G gives that 
OG° 

= —213,,p*. 1.14.5 

Ope aP ( ) 


The result Lx0 = 0 follows from the expression for the Christoffel 
symbols. 


2. The 2(n + 1)-exterior form 0 is closed (d0 = 0) on TV, a 2(n + 1)- 
dimensional manifold. The equality Lx0 = ixd0 + dix = 0 implies 
that dix = 0. 
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J.14.2 Codifferential 6 and Laplacian of an exterior 
form 


The codifferential, denoted by 6, of an exterior p-form 


wis geit de” A... A dx? 


on a pseudo-Riemannian manifold is defined to be the exterior (p — 1) 
-form 
Ow = Ss V Wig cccjp_1 dt?! A... A drfe- 
"= (p — 1)! TJ. Jp—1 tee : 
1. Show that 6 and the exterior differential d are adjoint operators in 
the sense that for forms with compact support in an open set of R”, 
0 of degree p+ 1 and w of degree p, it holds that 


(w, 00) = (dw, d0), 


where the scalar product of p-forms in the metric g is defined by 


(w,0)= f iy... Pt? tg, 


with ug the volume element of g. 


2. Show that A := —(dd + dô), called the Laplacian,”! is an operator ?!Note that the Laplacian is some- 
from p-forms into p-forms given by (a hat means that the index is times (for instance in CB-DMI) defined 
absent from the sequence) without the minus sign: A := dd + dô. 


jh h 
Age a = 9 VIV hWi...ip = X Ri, Whis...îq..ip 


1<q<p 


h, j 
-2 X Rii irihia 


r<q<p 


If g is Lorentzian, then ôd + dô is usually denoted by UO, and called 
the wave operator of g (the d’Alembertian if g is the Minkowski 
metric 7). 


Solution 


Take a 1 form w = wadx®; then 


1 F 
dw = 5 (V pwa — Vawa)da® Nda*, dw = —Vau* = -Vwa 


(ddw)g := —V°(Vawe — Vowa), (ddw)g = VeVowa. 
Therefore, the components of the Laplacian of w, a 1-form like w, are 
(Aw)g = V°V awe — V°V gwa + VeVowe} 
that is, using the Ricci formula for commutation of covariant derivatives, 
(Aw) = V°Vaws — R° gow, = V°Vaws — Re*wy. 


The general result is obtained similarly by straightforward computation. 
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?2Bel (1958). 


1.14.3 Geodesic normal coordinates 


The geodesics arcs on a pseudo-Riemannian manifolds (V, g) are, in the 
tangent bundle TV, solutions of a first-order differential system that 
reads in local coordinates (a, v%) 

da dv 

= = B 

AX = v“, a. = ee v7. (1.14.6) 
Classical theorems on such differential systems say that if the coefficients 
are Lipschitzian, i.e. if the metric is C11, then this system has one and 
only one solution, defined in a neighbourhood of A = 0; taking given 
values for A = 0, «°(0) = ag, v” (0) = a®. 


1. Show that if the functions 
g” := o” (A, a") 


satisfy the system (14.6) with ¢°(0,a%) = 0, 06%/OX(0, a") = af, 
then 

ge 

=) ,  k areal number. 


g= := o* (a 5 


Hint: Show that x“ and 2? satisfy the same differential system and 
take the same initial value. 


2. Deduce from the result of Part 1 that there exists a neighbourhood 
of xj such that the functions 


define an admissible change of coordinates (1°) — (y%). Show 
that in the coordinates y“, called normal geodesic coordinates, 
the geodesic arcs issuing from xo are represented by straight lines, 


Aw y“ = aA. Show in particular that in these coordinates the 
Christoffel symbols vanish at xo. 


1.14.4 Cases d= 1, 2, and 3 


One-dimensional manifolds are locally isometric with straight lines. 
For two-dimensional manifolds, the Riemann tensor has only (up to 
sign) one non-zero component, Rj2,12. What are the Ricci tensor and 
the scalar curvature? 
Show that in dimension d = 3, the Riemann tensor is linear in the 
Ricci tensor (see Problem 1.14.7 on the Weyl tensor). 


1.14.5 Wave equation satisfied by the Riemann tensor 


Deduce from the Bianchi identities a system of semilinear wave equations 
satisfied by the Riemann tensor?? when the Ricci tensor is equal to a 
given tensor p. 


Solution 


The Riemann tensor satisfies the Bianchi identities 
VaRgy, rp + VyRag au t+ VeRyarpy = 9- (1.14.7) 
These identities together with the Ricci identity imply that 
VV aR by au + VyV~Rag an + VeV" Ryan + Spy,ru = 9, (1.14.8) 
where 5Sz.y,, is a homogeneous quadratic form in the Riemann tensor: 


Say.ru = { Ry? Rop,ru an R* y6” Rapru + (Rya Rap, pu) 


(A> n) — 18 > a}. 


On the other hand, the Bianchi identities imply by contraction 


(1.14.9) 


VoRgy u + VRay + VoRa u = 0, (1.14.10) 
which can be written, using the symmetry Rag rp = Rdp,ag, aS 
VaR arn + V ultxg — V Rug = 0, (1.14.11) 


and the identities become 


V Vahora F S By An + {V (VL Rag a Va Rap) — (8 = y)} = 0, 
(1.14.12) 
If the Ricci tensor Rag satisfies the Einstein equations 


Rag = Paps (1.14.13) 


then the previous identities become the following quasidiagonal system 
of semilinear wave equations?’ for the Riemann tensor: 


Vo VaR By du + Spy,au t+ Jeyru = 9, (1.14.14) 
where Jgy,,, depends on the sources pag and is zero in vacuum: 


Jay, Ap = Vy(Vupas — VaPus) — (8 > 7): (1.14.15) 


1.14.6 The Bel—Robinson tensor 


The Bel—Robinson tensor associated with the Riemann tensor on 
a four-dimensional pseudo-Riemannian manifold was defined by Bel?* 
through the use of the left and right adjoints of the Riemann tensor 
defined by 


* pDr 1 o * 1 o 
CRi)op,ru = zop Rip (R*)ab an = 5 Moopo Rap» (1.14.16) 


with Nagan the volume form of the spacetime metric. 
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?3These equations are analogous to the 
Maxwell equations for the electromag- 
netic 2-form F. 


24Bel (1959). 
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1. Prove that the left and right adjoints of the Riemann tensor are equal. 
2. Prove the Lanczos identity 


(Ri +* RO Joga = Sogou t+ Sapdar— Sap9er— SarxGaou; (1.14.17) 
with S the Einstein tensor 
Sar = Rar - sian. (1.14.18) 
3. The Bel—Robinson tensor Q is the fourth-order tensor given by 
Qn Re a Rene 
+R ere Re K] 


1 
4 


Show that it is symmetric in its two first and two last indices, and 
under commutation of these pairs of indices. 


4. Show that the Bel-Robinson tensor of a vacuum Einstein spacetime, 
possibly with cosmological constant (Sag = Agap), is conservative; 
namely, that it satisfies the equation 


VaQ P~ =0. (1.14.19) 


It can be proved that the contraction of the Bel-Robinson tensor with 
timelike vectors is positive, thus giving the definition of a positive energy 
density for the Riemann tensor. This is used in some existence proofs 
(see Chapter VIII). 


Solution 


Parts 1-3 follow by straightforward computation (easier to do in an 
orthonormal frame). 
Part 4 uses the Bianchi identities. 


1.14.7 The Weyl tensor 


Two metrics g and g on a manifold V are called conformal if there is a 
positive scalar function Q = e°? on V such that 


gG=e* 9, ie. Gap =e gag, JL =e 22g. (1.14.20) 


Two conformal Lorentzian metrics have the same light cones and hence 
define the same causal structures. 


1. Show that the Christoffel symbols of two conformal metrics are linked 
by the relations 


ry = r3, + Igy, (1.14.21) 
where S is the tensor 


Dhu — Pau = Su = SAOD + 630, — 93,0. (1.14.22) 


2. Show that the Riemann tensors of g and g are linked by 


Ros — Rag n = 538 ap — OD ap + Jando” — Ipua” 
+ (53gap — 5a9an)IpBO?®, 

with 

Bag := Vade® — On 80,9. 


3. Show that the Ricci tensors Rg, and Rg, on a d-dimensional manifold 
are linked by 


Rey — Rep = — gpu V° Oa — (d — 2)V ,Og® + (d — 2) 


1.14.23 
x (0,,€03® = 950° ®0)®) ( ) 

and the scalar curvatures by 
e R- R= —2(d—1)V°0,® — (d—2)(d—1)GO,6. (1.14.24) 


4. Define the Weyl tensor?” by the relation 


1 
Wap,76 = Rap,yô — Fg (985 Ray — Jay Rgs + 9pyRas — Jas Rpa) 


= (93-Gas = Jay9as51) R- 


Check that the Weyl tensor has the same symmetries as the 
Riemann tensor and has zero trace. 

Show that the components Wag®, are the same for two conformal 
metrics. 

Since W is obviously zero for a flat metric, it is also zero if the 
metric is conformal to a flat metric.?° 

5. Show that the Weyl tensor of a three-dimensional pseudo-Riemannian 

manifold (M,g) is identically zero. Show that a three-dimensional 
pseudo-Riemannian manifold with Ricci tensor identically zero is 
locally flat. 


1.14.8 The Cotton—York tensor 
It can be proved that a 3-manifold is locally conformally flat if and only if 


its Cotton tensor vanishes. This tensor is a 3-tensor with components 
given in terms of the Ricci tensor and the scalar curvature of a metric 
Jij by 


1 1 
Craj: = Vi (2x = unk) = Vj (Fa = zouk) k (1.14.25) 


Define a 2-tensor, called the Cotton—York tensor, by 
yti = ser iO nkt 


Show that Y is symmetric, traceless and transverse, that is, such that 
V;Y“ = 0. 
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251t is equal to the Riemann tensor in 
a Ricci-flat space (Rag = 0). It is con- 
sidered that the Weyl tensor embodies 
in some sense the non-Newtonian prop- 
erties of the gravitational field, in par- 
ticular its radiation properties. This 
point of view is supported by the fact 
that the equations for massless fields, 
at least in four spacetime dimensions, 
are conformally invariant. 


26t can be proved that if d > 3, then 
the identical vanishing of the Weyl ten- 
sor implies that the metric is locally 
conformally flat. 
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27 Choquet-Bruhat (2000). 


1.14.9 Linearization of the Riemann tensor 


Write the linearization of the Riemann tensor as a geometric second- 
order operator on hag := gag- 


Solution 
bRag, = CTA, — TA, + E278, — AT? (1.14.26) 
aß BX ga BHO Ər OH apy Bu + bpt ap? 14. 
ð ð 
à = r A a pA À 
bRop u = a Vha = o Ton + Vaa Ta OT T 
= OE gpk ay TT (1.14.27) 


where or g is the tensor 
1 
AV Àg = 5 (Voh3 + Vahd — V*Rag) - (1.14.28) 


Therefore, 6Rag*,, is the tensor 
Oe A à 
SRag™u = VaXåu — Vek 
I.14.10 Second derivative of the Ricci tensor 


The second (Fréchet) derivative, also called the second variation, of 
an operator P at u € E; is, if FE, is a Banach algebra, a quadratic form 
P", in du such that 


6°P := P", (du, ôu), (1.14.29) 
with 
P(u + ôu) — P(u) = Pi(u)du+ ”,(u)(du, du) + o(|dul?). (1.14.30) 
Compute the second variation of the Ricci tensor at a metric g as a 
quadratic form in h := ôg : 


ô Rag = Rig(g)(h,h). (1.14.31) 


Solution 


Straightforward though lengthy computation gives?” 
0 Rog = —h™ [V\(Vahoy + Vohay — Vulag) — VaV ghan] 
1 
-Vah (Vahgu + Vghaun — Vuhag) + 5 Veh” Vahu 
1 
+3 V`hg(Vahpa + Veharx — Vahap) 
+V ABV Rey — VahkV aha. (1.14.32) 
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II.1 Introduction 


Special Relativity, formulated by Enstein in 1905, revolutionized our 
conceptions of time and space.! 


II.2 
II.2.1 


Newtonian mechanics 


The Galileo—-Newton Spacetime 


In the spacetime of Galileo and Newton, the simultaneity of two events 
is a notion independent of observers. Space and time are absolute ob- 
jects,? which exist independently of matter and events that may happen 
in them. The mathematical model of Newton spacetime is the direct 
product of R, where time varies, and space, which is assumed to be a 
Euclidean space E’, that is, R3 with the Euclidean metric, which reads, 
in natural frames that mathematicians call ‘Cartesian’ and physicists 
‘inertial’, 


ds? = 5 (da:*)?. 


i=1,2,3 


(11.2.1) 


To link this mathematical model with observations, one has to iden- 
tify an inertial frame of this Euclidean space with observed objects (see 
Section II.2.3). 

The isometry group of the Euclidean model of space permits the ex- 
istence of solid bodies remaining isometric to themselves under motions. 
In Galileo-Newton spacetime, lengths can be measured by comparison 
with a standard metre, a piece of metal deposited in Sevres. For dis- 
tances that are too large to be compared directly with a copy of the 
standard metre, one can use properties of Euclidean geometry, such as 
triangulation. The existence of clocks measuring absolute time is pre- 
dicted by the periodic phenomena resulting from Newtonian dynamics.’ 


II.2.2 Newtonian dynamics. Galileo group 


The fundamental law of Newtonian dynamics for a particle assumed to 
be pointlike is 


F=my, (11.2.2) 


II.1 
II.2 
II.3 


II.4 


II.5 


II.6 


II.7 
II.8 


II.9 


II.10 
II.11 


Introduction 35 
Newtonian mechanics 35 


The Lorentz and 

Poincaré groups 38 
Lorentz contraction and 
dilation 39 
Electromagnetic field 


and Maxwell equations 
in Minkowski 


spacetime M+ 40 
Maxwell equations in 
arbitrary dimensions 42 
Special Relativity 43 
Some physical 

comments 45 
Dynamics of a pointlike 
mass 46 
Continuous matter 49 
Problems 54 


1See e.g. the historical discussion in 
Damour (2006). 


2Newton’s contemporary Leibniz was 
in disagreement with this postulate. 


3The small oscillations of a pendulum 
are approximately such a phenomenon. 
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4Mach’s considerations partly inspired 
Einstein’s General Relativity. 


5In Paris, 31 h 50 m, as can be verified 
by a visit to the pendulum exhibited at 
the Musée des arts et métiers or prob- 
ably soon to one suspended beneath the 
dome of the Panthéon (its original lo- 
cation). This dependence results from 
the fact that the rotation of axes linked 
with the Earth’s rotation with respect 
to the Copernican inertial system leads 
to what is called in Newtonian mech- 
anics a complementary inertial force. 
The appearance of sin À, equal to 1 at 
the North Pole, is due to the value 
of the vertical (i.e. orthogonal to the 
Earth’s surface) component of the ro- 
tation vector of the Earth, parallel to 
the line between the poles. The hori- 
zontal component does not contribute 
to the complementary inertial force, be- 
cause it is parallel to the velocity of 
the pendulum (see e.g. Bruhat, 1934, 
p. 134). 


where m is a constant, the mass of the particle (called inertial mass), y 
is its acceleration in absolute time with respect to absolute space, and 
F is a phenomenological vector, the applied force. The components of y 
in Cartesian coordinates zt for the particle in the absolute space FÆ? are 
the second partial derivatives f = d?x'/dt?. 
Newton’s law (2.2) is invariant under the following time-dependent 
change of coordinates in Newton’s absolute space E’: 
z’ =g + vtt x, (II.2.3) 
where the v’ and 2, are constants. The corresponding t-dependent Car- 
tesian coordinates system in F? is in uniform translation with respect to 
the original absolute space. All such reference frames are inertial frames. 
The set of transformations (2.3) forms a group, called the Galileo 
group. It had already been remarked by Galileo that a uniform-in-time 
translation of a boat cannot be detected by observers in the hold. More 
generally, all the physical laws of Newtonian mechanics are assumed to 
be invariant under the Galileo group, in the sense that they admit the 
same formulation in all inertial frames. 


II.2.3 Physical comment 


Inertial frames are a physical reality. Someone seating on a disk turning 
with respect to the Earth feels a ‘centrifugal force’ due the fact that 
axes fixed on the disk are not inertial for the Galileo-Newton spacetime. 
More generally, non-inertial coordinates manifest themselves through 
non-vanishing Christoffel symbols of the Euclidean metric of the phys- 
ical space F? in these axes. These non-zero Christoffel symbols cause 
the appearance of what in Newtonian mechanics are called Coriolis or 
inertial forces. So the problem is posed, what is the physical origin of 
the inertial frames? The answer appeared simple for Newton, namely 
the a priori given absolute space and time, but the problem of iden- 
tification with observed reality was there all the same. The answer for 
Mach was that ‘absolute’ space was determined by the matter content of 
the universe. In practice, Mach and Newton agree, in an approximate 
way. Experiments on Earth have shown that Newton’s laws of dynamics 
written in his absolute spacetime are satisfied to a good approximation 
if one assumes an inertial frame linked with the Earth, and to a better 
approximation if the inertial frame, called a Copernican frame, has its 
origin at the center of the Sun and with axes directed to specific stars. 
This was observed by Foucault by recording the oscillations of a pen- 
dulum on Earth in a vertical plane that remains approximately fixed 
for a short time, but rotates through 360° in a time that depends on 
the latitude A, namely® 1 day/sin\ because of the centrifugal force due 
to the rotation of the Earth. When the Solar System was known to be 
rotating in the Milky Way, another inertial frame was considered, which 
led to verification of the existence of a further very small correction for 
experiments made in the Solar System. Astronomers now know that the 


Milky Way itself does not stay still in the universe, but it is impossible 
to determine an inertial frame for the universe if such a thing exists. 
Finally, the physicists of the nineteenth century introduced a mysteri- 
ous medium, called aether, invisible and intangible, though with all the 
properties of a solid.® 


II.2.4 The Maxwell equations in Galileo-Newton 
spacetime 
The Maxwell equations unify the various classical physical laws that 


govern the electric and magnetic fields (E, H). These equations in non- 
inductive media read, in 3-vector notation on R°, 


div E =q, 
div H = 0, 


Coulomb’s law, (11.2.4) 


Gauss’s law, non-existence of magnetic charges. (II.2.5) 


The scalar q is the charge density. These equations are ‘constraints’, i.e. 
they do not contain time derivatives. The following are evolution equa- 
tions, with c, the speed of light, a dimensionless constant that depends 
on the units of space and time (and can be made equal to 1 by choice of 
a relation between these units”): 


10H 
curl E = ae, Faraday’s law, (II.2.6) 
c Ot 
. LOE F 
curl H = j + PTE Ampére—Maxwell law. (II.2.7) 
€ 


The vector j is the electric current. The term 3E /ðt, called the ‘displace- 
ment current’, was introduced by Maxwell and led to the consideration of 
the electric and magnetic fields as the splitting, in an observer-dependent 
fashion, of one entity called the electromagnetic field, into these electric 
and magnetic fields. The set of equations (2.4)—(2.7), still valid today, 
are the shining success of nineteenth-century theoretical physics. 

The various laws written above were interpreted before Einstein with 
t the absolute time and E? the absolute space determined by the 
mysterious medium of the aether. The Maxwell equations imply that 
electric and magnetic fields—and also light, which is an electromag- 
netic phenomenon—propagate in vacuum with the constant velocity c, 
independent of the time ¢ and the location in this absolute space.? The 
number c can be made equal to 1 by appropriate choice of space and 
time units. 

The problem, for Newton’s mechanics, is that the Maxwell equations 
are not invariant under the Galileo group. 

It was the mathematical genius of Lorentz and Poincaré to discover the 
group that leaves invariant the Maxwell equations, but neither of them 
discarded Newton’s absolute time, nor the aether filling the absolute 
space. 
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6Perhaps physics is coming back to 
the idea of an absolute spacetime de- 
termined by its content, everywhere- 
present pairs of virtual particles, but 
perhaps spacetime is not a differenti- 
able manifold, either below the Planck 
length or at cosmological scales. 


“See Section II.5 in Chapter II. 


8It was already known that the speed of 
light is finite, and it had been measured 
by Romer in 1675 from observations of 
the eclipses of Jupiter’s Moons. 

°In the absence of electric current and 
charge, this set of equations implies 
that E and H satisfy the wave equation 
in the Minkowski metric 


1 0?H 


a Ar ae 
c2 Ot? + 
1 OPE 
EEEN =i, 
c? Ot? t 
> 
= i2 
î=1,2,,3 (at) 
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II.3 The Lorentz and Poincaré groups 
The (n + 1)-dimensional Lorentz group is the group of linear maps of 
Rett, 
X“ = LS X”Ă, 
that preserves the quadratic form 


n 
EPEY 
i=1 
that is, the elements L%, of matrices L that represent the Lorentz group 
on R”! are such that for each set {X°} € R”+!, it holds that 


- (L9 XP + SO (LEX)? = (KX)? + X (XP. (3.1) 


This identity with X” = 1, X* = 0 implies that 


(Lp)? =1+ X (P 21. (11.3.2) 
qalan 


The orthochronous Lorentz group is the subgroup that preserves 
the orientation of R defined by the element L}, i.e. such that 


Lp. > 0; (11.3.3) 


The elements with L}, < 0 reverse the orientation of R. They do not 
constitute a group. 

Equation (3.1) implies that the determinant of L is equal to +1 or 
—1. Proper Lorentz transformations are those that preserve the 
spacetime orientation; i.e. they are such that 


Det(L) = 1. (11.3.4) 


Particular Lorentz transformations are the space rotations 
L9, = Li, = 0, > xP = X P. 


Standard geometrical considerations show that every proper Lorentz 
transformation can be written 


L= RıLsRo, (11.3.5) 


where Rı and Rə are space rotations and Lg is a so-called special Lor- 
entz transformation, acting only in the time and one space directions, 
namely such that (3.1) reduce to 


(Iy = (Lar =i, (11.3.6) 
GP = (Lay =, (11.3.7) 
LS, by — LLL}, = 0. (11.3.8) 


II.4 Lorentz contraction and dilation 


Exercise I1.3.1 Show that the general solution of these equations is, 
for an orthochronous transformation, 


L, = L}, = cosh, L} = LÀ, = sinh g. (11.3.9) 


The (n + 1)-Poincaré group is the group of isometries of a flat 
Lorentzian (n + 1)-manifold, the Minkowski spacetime M”*?. It is the 
semidirect product of the translation group R"+! of R"+! and the 
Lorentz group L"*?. 

See Problem II.1 for the statement and proof of the invariance of the 
Maxwell equations written for a pair of vectors FE, H. 


II.4 Lorentz contraction and dilation 


Before the new physics introduced by Einstein—that is, redefinition 
of time and space as observable reality instead of abstract a priori 
concepts—the results obtained by Lorentz and Poincaré led to contro- 
versial studies on the dynamics of charged bodies. We give some of their 
results, obtained by considering a special Poincaré transformation. We 
set x° = t, €? = r, and V := tanh y, and hence sinh y = V/V/1 — V2, and 
cosh yp = 1//1—V? (note that V < 1). We obtain the transformation 
law 


tr zie V(x" = et) i 


z gl 7 git = en ee V(t = T’) 
Ve 7 


V1 — V2 , 
(11.4.1) 


t-T= 


where (x!,t) and (€',7) are the coordinates of two points in the (xt, t) 
plane in an inertial system, and the primed quantities are the coordinates 
of these points in another inertial system in uniform translation along 
the x! axis with respect to the first one. With classical interpretation 
of time and space, the relative velocity of this translation is dx’! /dt for 
fixed a!, while the relative velocity of the unprimed frame with respect 
to the primed one is dz! /dt' for fixed x’. These velocities resulting from 
the above formulas are found to be 


dx! V dz" -V 


VG, |xz't=const = -F d — xzl=const = F. 11.4.2 
ai | 1 t 1_v2 an T” t V1- V2 ( ) 


e Consider two events simultaneous in the primed frame: t = 7’. The 
formulas (4.2) give that 
ri 1 
1 i 2—6 
rT -E =, 11.4.3 
and hence the spatial distance observed in the primed frame is 
smaller than that observed in the unprimed one. This is the Lorentz 
contraction. 
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101t was first introduced as a tech- 
nical tool (in the Euclideanized version 
with «4 = iz?) by Poincaré in 1905. 
Later, in 1908, Minkowski realized the 
deeper mathematical—physical import- 
ance of the four-dimensional structures 
7 and F. 


1lAs in Chapter I, we have adopted 
the MTW convention, although the 
signature (+ —— —), which is more con- 
venient for some problems, is used by 
many authors. 


e Consider two events with the same spatial location in the primed 
frame: z’! = £1, Then 
t-7r>t—r'. (11.4.4) 
This is the Lorentz dilation. 


The Lorentz contraction and dilation are not intrinsic phenomena. 
They are relative to the observers, linked with their reference frames, as 
is obvious from the fact that they are reversed by exchanging the roles 
of these frames when defining simultaneity or spatial coincidence. 


II.5 Electromagnetic field and Maxwell 
equations in Minkowski 
spacetime M+ 

Recall that the Poincaré—Minkowski spacetime!? M* is the (3 + 1)- 


dimensional manifold R* endowed with a Lorentzian, flat metric that 
reads,!! in so-called inertial coordinates, 


n= —(dx°)? + X` (di). (11.5.1) 
i=1,2,3 
We define the electromagnetic 2-form F on M4, 
1 
= 5 Fapdz* ^ dx”, (II.5.2) 


by its components in inertial coordinates deduced from the electric and 
magnetic vectors E and H previously considered on Æ’. We set 


Fio = Ei, F23 = Hi, F31 = H2, Fiz = H3. 


The electric current is defined as the vector J* on M4 with components 
J =i PS 


Theorem II.5.1 Giving to the velocity of light its geometric value 
c= 1, the Maxwell equations (2.4)-(2.7) can be written as a pair of equa- 
tions for the electromagnetic 2-form F on Minkowski spacetime M+. In 
arbitrary coordinates, the equations are, with V the covariant derivative 
in the Minkowski metric in these coordinates, 


dF =0, ie. VaF ay + VyFap + VeF ya = 0, (11.5.3) 


and 


V-PeJd, ie Vaf t aw, (11.5.4) 


where indices are raised with the Minkowski metric. 
Equation (5.4) implies the conservation of the electric current J: 


VaJ* =0. (IL.5.5) 
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Proof. It is straightforward to check in inertial coordinates that equa- 
tions (2.4) and (2.5) are equivalent!” to dF = 0 and that equations (2.6) 
and (2.7) are equivalent to V - F = J. This equation implies V - J = 0 
(see Chapter I), which could have been deduced directly from equations 
(2.4) and (2.5) in inertial coordinates. Since d and V- are geometric op- 
erators, the Maxwell equations for F considered as an exterior 2-form 
are valid in any frame of M4. 

Since the operator d does not depend on the spacetime metric, and 
V - F is invariant under the group leaving the metric invariant, the 
Maxwell equations are invariant under the isometry group of Minkowski 
spacetime. a 


Conversely, given on spacetime an exterior 2-form, we can define for 
a given observer a corresponding electric field as follows. We consider a 
local coordinate system for which the observer is at rest, i.e. the observer 
describes a time line with unit tangent vector u with u“ = dg. Then, 
the observer’s electric vector field, orthogonal to u, is EÊ = F°“ug, i.e., 
in his proper frame,'? E’ = F”. 

The magnetic 2-form of an observer is the trace in his proper space 
of the electromagnetic 2-form. A magnetic space vector field is defined 
only if this space is three-dimensional. In this case the magnetic field 
is the adjoint of the magnetic 2-form, given by the contracted product 
with the volume form of the three-dimensional Euclidean metric: 


H” := ant Ry 
Note that H so defined (for n = 3) is a vector on the space manifold, 
but it is not a spacetime vector. 
Under a change of observer, i.e. of proper frame, the space and time 
components of the electromagnetic field are mixed. This fact has been 
checked over and over in laboratories. 


Exercise I1.5.1 Write transformation of the electric and magnetic 
fields under a change of Lorentzian frame with only non-zero components 
L& with a and a! equal to zero or one. 


Ox” Ax?" 
Ox® Ox? 


Mazwell tensor 


Hint: Use Fag = Fygi = U2 LG Fup: 


The Maxwell tensor 7 is the symmetric 2-tensor given on Minkowski 
spacetime by 


1 
Tag = Fo Fax — gros F” Py. (11.5.6) 


Exercise II.5.2 Show that the Maxwell tensor is traceless in spacetime 
dimension 4. 


The definitions of Æ and H in space dimension 3 show that the 
components Toa Of T in inertial coordinates read as follows, with ¢€,j7 


12Recall that in inertial coordinates on 
M* with c = 1, it holds that FY = F; 


j» 
Fig = —F*®, and Vo = 0/dx%. 


13The proper frame of an observer is 
the orthonormal frame with timelike 
vector the unit tangent vector u to 
its trajectory. Its proper space is the 
hyperplane orthogonal to u. 
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totally antisymmetric and €493 = 1: 
1 
Too = T +H?), Toi = —eip BI H'. (11.5.7) 


The component Too = T°° is the energy density of the electromagnetic 
field, and the 7T% = —To;, i = 1,2,3, are the components of the Poynting 
energy flux vector P, given in vector product notation on R? by 


P=ENH. (11.5.8) 
Lemma II.5.1 Modulo the Maxwell equations, it holds that 
VaT”? = J*Fs. (11.5.9) 


Proof. Straightforward calculation gives 
a ar ar 1 Au 
VaT = (Va F) Foa + Pw V a Foa — 5 VeF yp: 
Hence, changing names of indices and using antisymmetries, 
1 
VaTZ = (Va F>) Foa + 5 FO (VaFoa +V Fog t+ VeFyo). (I1.5.10) 


The Maxwell equations (5.3) and (5.5) imply the result. a 


The vector JF gd is called the Lorentz force. 

When the Lorentz force is identically zero, the equations Var®? = 0 
written in inertial coordinates are the usual equations of conservation of 
energy and momentum. 


II.6 Maxwell equations in arbitrary 
dimensions 


The Minkowski spacetime M+! is the manifold R"+! endowed with a 
flat metric that reads in inertial coordinates 


n= (dx)? + X. (da')?. (11.6.1) 


In a Minkowski spacetime of dimension n + 1, it is natural to define an 
electromagnetic field as an exterior 2-form F that satisfies the Maxwell 
equations 


dF=0, V:-F=J, with V-J=0. 


Let u be a unit timelike vector field. The electric field relative to u is 
the vector field u° Fog; it is orthogonal to u. Choose for u the time 
vector 0/Ox° of a system of inertial coordinates. The electric field is 
then a space vector with components FE; := Fio, while the components 
F;j define on each submanifold Mo := { x° = const} an exterior 2-form, 
which can be called the magnetic 2-form. As mentioned previously 
the magnetic vector field is defined only in space dimension 3. 


The definition (5.6) of the Maxwell stress energy tensor, as well as its 
conservation law (5.9), extend to arbitrary dimension. 


II.7 Special Relativity 
II.7.1 


Lorentz and Poincaré had made mathematical studies, but it was Ein- 
stein who made the conceptual jump to discard Newton’s absolute time 
and space as physical realities.‘ Einstein, in Special Relativity, replaces 
the direct product E’ x R by the Minkowski spacetime M*. An event is 
now a point in spacetime. Its history is a timelike trajectory in M+. An 
Einsteinian revolution was to discover that the parameter t appearing 
in the Minkowski metric has no physical meaning: the quantity meas- 
urable by well-defined clocks (mechanical, atomic, or biological) is the 
length of their timelike trajectories in spacetime; it is called the proper 
time. More precisely, if C : t > C(t), t € [t1, t2] is a future causal curve 
parametrized by t joining two points of M4, then the proper time along 
that curve is the parameter-independent quantity 


mE C 
Lat? at 


Relativity postulates the existence of universal clocks defined by spe- 
cific physical phenomenon that measure the proper time. Such clocks 
are nowadays obtained by using the frequency of radiation emitted by 
specific atomic transitions, predicted by quantum theory to have a con- 
stant universal value. The actually adopted standard clock is the caesium 
atom, which exhibits a particularly stable (of the order of 10716) micro- 
wave transition between two particular energy levels. The second is now 
defined through the time measured by the caesium clock.!° 

Einstein’s concept of proper time has been proved valid in all 
experiments performed to test it. 

In a Lorentzian manifold, the length of timelike curves joining two 
points has a local maximum for a timelike geodesic. Geodesics of 
Minkowski spacetime are represented by straight lines in inertial co- 
ordinates. In particular, the line where only the parameter t of inertial 
coordinates varies is a timelike geodesic. Therefore, the time measured 
by an observer at rest in some inertial coordinate system is greater than 
the time measured by a traveller which does not follow such a straight 
line between their separation and their reunion. This so-called ‘twin 
paradox’ (see Fig. II.1)!® has been long verified for elementary par- 
ticles in modern accelerators. The reality of proper time has also been 
checked in 1971 in an airplane flight of a caesium clock which was late 
with respect to a similar clock that remained on ground.!” In long space 
travels it could be verified with the human biological clock. 

The case of spatial distances is more delicate to treat because of the 
lack of an absolute notion of simultaneity. 


Proper time 


(11.7.1) 


I.7 Special Relativity 43 


14See Damour (2013a). 


SIN 


ds? = -dP + dx? 


T 
n= [Vi-Pa<r 
0 


Fig. II.1 The twin paradox. 


15t is likely, however, that in the fu- 
ture, the caesium clock will be replaced 
as a standard by an optical clock. 


16There is actually no paradox, be- 
cause the two twins do not have the 
same history: one describes a geodesic 
in Minkowski spacetime, and the other 
does not—in fact, he has to use a mo- 
tor to follow his trajectory. This phys- 
ical effect is not to be confused with 
the apparent time dilation, which is a 
reciprocal effect. 


17Tt is pleasant to know that the trav- 
eller ages less than the one who stays 
put. 
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18This definition comes from associat- 
ing with an observer at some point of 
spacetime the inertial system in which 
he is momentarily at rest. 


II.7.2 Proper frame and relative velocities 


A velocity, even in Galileo-Newton mechanics, is always defined with re- 
spect to some observer. The problem is more complex in Relativity, since 
there is neither absolute time nor absolute simultaneity. In Relativity, 
an observer is defined by its world line, a timelike curve. The norm (in 
the Minkowski metric) of its timelike tangent vector is dependent on the 
choice of the parameter on the curve; this norm has no physical meaning 
if the parameter is not specified. We label as a ‘unit tangent vector’ u 
this tangent vector normalized by 


n(u,u) = —1. (II.7.2) 


We call the proper frame of the observer (not necessarily following a 
geodesic) at some point of spacetime an orthonormal Lorentzian frame 
with u as its timelike axis. Consider an object following a causal line 
with tangent v at the considered point. If in the proper frame the vector 
v has time component v? and space components vê, we say!® that the 
object has velocity V with respect to this observer, where V is a space 
vector, orthogonal to u, with components 


i U 
a 3: (1.7.3) 
Since v is causal, it holds that 


V|= , (11.7.4) 


with |V| = 1, the speed of light, if and only if v is a null vector. 

In particular, consider two observers at a point of spacetime with unit 
velocities u and u’. Choose their proper frames such that they are linked 
by a special Lorentz transformation, i.e. e2 = eh, e3 = ey. The velocity 
of the primed observer with respect to the unprimed observer is then 
along the axis e and is given by, with 4% the components of u’ in the 
unprimed frame, 


u 1 La 
= = 0 IL7. 
V= 75 (11.7.5) 


since the components of u’ in the primed frame are u’! = 0, u = 1. 
Addition of velocities 


As foreseen from the overturning of the notion of absolute time and 
simultaneity, relativistic relative velocities do not add simply as in 
Newtonian mechanics. 

Consider two systems of inertial coordinates (t, £t, 2%) and (t’, 2’, 7“). 
They define at each point of spacetime two Lorentz frames, linked by a 
special Lorentz transformation, in relative motion with velocity V. The 
velocity of a given point particle is dæ! /dt with respect to the unprimed 


frame and dx’'/dt' with respect to the primed frame. The formula (4.1) 
implies that 


dx! E dx + Vdt' 


di dt’ 4 Vdr (17) 
and, therefore, setting dx! /dt = U, dx /dt! = U’, 
U'+V 


This coincides in a first approximation with the classical formula for 
addition of velocities when U’V is small with respect to 1, the velocity 
of light. 

The formula (7.7) implies that U = 1 when U’ = 1: the speed of 
light is independent of the relative velocity V of different observers. This 
property, very surprising in the framework of Galilean kinematics, but 
already implicit in the Maxwell equations, was found experimentally by 
Michelson!’ and verified more accurately by Michelson and Morley. 
They compared the speed of light in the direction of the orbital velocity 
of the Earth and in a transverse direction. To the surprise of the scientific 
world, they found the same value in both cases to an accuracy higher 
than possible experimental error. Since then, it has been verified with 
greater and greater accuracy, in a medium that does not interfere with 
this velocity, the vacuum. The most recent experiments give, with a 107° 
accuracy, 299 792.458 km/s. The Michelson—Morley experiment was in 
part?° a source of inspiration to Einstein to discover Special Relativity. 

Since the speed of light in vacuum is a universal constant, it can 
be used to define the standards of length and time from one another. 
Physics has shown that time can be measured more accurately, with a 
caesium clock, than length, for which the previous standard was related 
to the wavelength of a krypton transition.?! It has been decided by 
scientific authorities that the metre is now officially defined to be the 
distance covered by light in (299 792 458)~! seconds. The speed of light 
in vacuum is therefore fixed to be a universal constant. Unless otherwise 
specified, we follow the mathematical usage of choosing units of length 
and time such that this constant is equal?” to 1 (its ‘geometric’ value). Of 
course, in making comparison with observations or experiments, other 
units may be more appropriate. 


II.8 Some physical comments 


Like all mathematical models, Special Relativity aims at providing as 
accurate as possible a picture of physical reality. No mathematical model 
can replace reality, but the first problem is to be able to compare the 
results given by equations with observed facts. In Special Relativity, 
one should physically identify spacetime inertial reference frames for 
Minkowski spacetime. The choice posed a puzzle for Einstein, since 
Minkowski spacetime considered as a global model is empty. 
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19See footnote 8. 


20Tt seems that Einstein was more in- 
spired by Faraday’s law of induction 
than by the Michelson—Morley experi- 
ment. 


21The caesium clock defines the sec- 
ond as 9192631770 periods of the 
radiation from a specific caesium-133 
atomic transition. Optical clocks de- 
fined by electromagnetic radiation are 
even more precise. See, for instance, 
Salomon (2013). 


?2Tf the second is physically defined (for 
instance through the caesium clock), 
then unit of length for which the vel- 
ocity of light is 1 is the distance covered 
by light in 1 second, that is, 299 792 458 
metres. 


46 Special relativity 


23The rest mass is a number depending 
only on the nature of the particle. 


?4Recall our choice of signature 
Seb ee 


II.9 Dynamics of a pointlike mass 


II.9.1 Newtonian law 
+23 


The Newtonian equation of motion (2.5) of a particle with rest“? mass 
mo a constant, subjected to a force f, can be written 
d(mov) 
= II.9.1 
w = 7, (119.1) 


with v its velocity with respect to a Galilean inertial frame and t the 
Newtonian absolute time. When f = 0, the particle is in uniform recti- 
linear motion in all Galilean inertial frames. In a non-Galilean frame, a 
term, called the inertial force, must be added to f. 

From (9.1), we obtain the Newtonian energy equation 


mov?) = f-v, (11.9.2) 


where the raised dot denotes the Euclidean scalar product. 

If we want to write Newton’s law (9.1) in a non-Galilean frame, we 
must add to f the so-called inertial forces due to the motion of the 
considered frame with respect to a Galilean one. 


II.9.2 Relativistic law 


The trajectory of a pointlike massive particle in Minkowski spacetime is 
a timelike curve. Since there is no absolute time to define its velocity, 
we consider its unit velocity u, the tangent vector to its trajectory 
parametrized by the proper time (the Minkowskian arc length s). The 
components of u are, in arbitrary coordinates,?+ 


u? = satisfing uu, =—l. (11.9.3) 
8 


The acceleration of the particle is the derivative of u in the dir- 
ection of itself, u~V,u’. The time-dependent, coordinate-dependent, 


Newtonian equations in space, (9.1) and (9.2), are replaced in relativistic 
dynamics by a spacetime, coordinate-independent, equation 


u*Va(mou*) = FF, (11.9.4) 


where F is now a spacetime vector. Using ugV aul = 0, a consequence 
of ugu? = constant, (9.4) gives 


u*damo = ugF®. (11.9.5) 
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This equation holds with mp a constant? on the trajectory if and only 
if uaF? = 0, ie. if F is orthogonal (in the Minkowski metric) to the 
trajectory,” which we shall assume in what follows. 


Remark II.9.1 Equation (9.4) reads (see Chapter I) in an arbitrary 
frame 


Pipe E re Xr 
mo ($ Tyr ale E (II.9.6) 


In a Minkowskian inertial reference frame, the connection coefficients 
vanish, and (9.6) with index i reduce to an analogue of Newton’s equation 
in a Galilean frame. This remark leads to interpretation of the term 
involving T as a kind of inertial force. 


II.9.3 Newtonian approximation of the relativistic 
equation 
We denote by V as in Section II.7.2 the relative velocity of a mas- 


sive particle with respect to some Minkowskian inertial coordinates 2’, 
x5? = t, where its unit velocity is u® = dx*/ds; that is, we set 


Ve =¢ = —; (11.9.7) 


te =s = gada (11.9.8) 


ds y1- VP ds ~ y1- VE 


These relations imply that the relativistic equations (9.4) can be written 
in the inertial frame as 


ya o moV’ -Fi a? o mo — F° 
dae ( V1- WP) ° Oe (VI-V) ` 


On the trajectory, it holds that 


and hence the above equations read 


d moV’ ž d mo 

eee VV, SS eer VI SIV Fe. 

rae VWI, aaa PV " | 
II.9.9 


The formulas with indices i look like the Newtonian ones if we replace 
mo by 


Far and fi= F*/1—|V/-. (11.9.10) 


25The rest mass of a molecule is modi- 
fied by chemical reactions. The rest 
mass of an atom is modified by its 
absorption or emission of photons. 


26This is the case for the electromag- 
netic Lorentz force. 
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27It is possible that all fundamen- 
tal elementary particles have zero rest 
mass and that the positive rest mass of 
the particles that appear to us as elem- 
entary is only an interaction energy. 


28 After many years during which the 
rest mass of the neutrino was believed 
to be zero, it has now been established 
that this rest mass is very small, but 
non-zero. 


For small velocities, i.e. for |U| small with respect to 1, we have 
1 —[UP=1 — 4|U/?. Then, 


1 
m=mo (1 + slvr) (11.9.11) 


Therefore, in this approximation, the mass m appearing in the relativ- 
istic equation is the sum of the rest mass of the particle and its kinetic 
energy. 


Remark II.9.2 The force ft is replaced in the new notation by 
'\/1—|V|?. The condition for the constancy of mo reads 
: V; F as F° 
ug F? zuf euf = =0 


Vvi= |V}? 


and hence is satisfied by the condition 


po “F 


ey 


where V; ft is in Newtonian mechanics the power of the force f. 


II.9.4 Equivalence of mass and energy 


The addition of the ‘kinetic energy’ and the energy of applied forces to 
the rest mass mo in the relativistic equation led Einstein to his famous 
postulate of the equivalence of mass and energy,?’ with a conversion 
factor of the order of c?, where the speed of light c is taken to be 1 in 
the expression used for the Minkowski metric (geometrized units). The 
equivalence of mass and energy has been verified in nuclear reactions 
in a spectacular fashion. The energy can be created by the fission of 
a uranium atom induced by a collision with a neutron. The sum of 
the rest masses of the incoming particles is greater than that of the 
post-fission particles, the difference being seen as kinetic (or radiated) 
energy. Nuclear fusion of particles into a single particle with rest mass 
smaller than the sum of the rest masses of the incoming particles also 
produces energy in accord with Æ = mc?. Complex fusion reactions 
involving hydrogen and its isotopes deuterium and tritium, leading in 
particular to helium nuclei, is the main source of the heat produced 
by the Sun. Researchers are trying actively to reproduce it on Earth. 
Constant rest masses are assigned to elementary particles, the photon 
(rest mass zero), electron, proton, neutrino,?® etc. The proton has now 
been experimentally found to be composed of quarks and gluons. 


Remark II.9.3 A useful physical quantity is the energy-momentum 
P, a vector tangent to the trajectory and defined for massive particles by 


P% =mu%, hence PP, =—m?. 


In an arbitrary Lorentzian frame, the component P? is considered as the 
energy of the particle with respect to that frame, while the components 
P! define its momentum. This splitting is frame-dependent. 


11.9.5 Particles with zero rest mass 


An energy-momentum is also defined for particles with zero rest mass. 
Indeed, it was recognized by Einstein in 1905 that light can be de- 
scribed both by waves and by particles, the latter being called photons. 
A particle with zero rest mass moves on a null geodesic, and the energy— 
momentum of a photon is a null vector P tangent to this geodesic, 
satisfying the equation 


(PP — S7( Pi)? = 0. 
i 
Its energy with respect to some Minkowskian reference frame is the 
component P°. 

To determine P, we consider that the wave associated to the con- 
sidered photon is given by a scalar function e’” with hypersurfaces of 
constant y normal to the trajectory of the null particle and hence with 
grad y a null vector proportional to P. The wave has a frequency v in 
some inertial coordinates if Op/Ox° = 27v. 

Einstein introduced in 1905 the postulate that the energy of a photon 
with respect to some Minkowskian reference frame, i.e. the component 
P?, is proportional to the frequency observed in that frame of the as- 
sociated wave. The energy-momentum of a photon in some inertial 
coordinates is, with h Planck’s constant (introduced by Planck in 1900 
with a different meaning), 


h 
P= — grady, P?= hv. 
27 


The relation P® = hv introduced by Einstein to explain the photo- 
electric effect is one of the key discoveries that led to quantum 
mechanics. 


II.10 Continuous matter 


Our never-ending improvement in the exploration of reality has made us 
see that matter is discontinuous at all scales: galaxies in the cosmos, stars 
in galaxies, molecules in stars, atoms in molecules, ‘elementary’ particles 
in atoms, quarks in protons, strings, ... But, at some scales, which we call 
‘macroscopic’, we are not interested in the impossible task?’ of following 
the individual motions of constituents. We wish to describe the behav- 
iour of volumes, small at the observed scale, but large at the scale of the 
constituents, which we call particles, of the model that we are studying. 
Particles eventually go in and out of such a volume. If it is possible to 
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29Even without speaking of the intru- 
sion of quantum mechanics! 
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30 Except for null fluids, where 
uia = 0. 


define, pointwise on spacetime, measurable macroscopic quantities that 
characterize the behaviour of matter at the scale of interest, we say that 
we are dealing with continuous matter. 

Real matter is much too complicated to be represented by a single 
model. In fact, all models are only an approximation of a type of mat- 
ter. Roughly speaking, fluids are matter models where, in the absence of 
external forces, the only action on an elementary volume is an interfacial 
action with neighbouring elements that does not prevent them from slip- 
ping against each other. Perfect fluids are those for which the antislipping 
force has minimal action, that is, is orthogonal to the interface. 

In Newtonian mechanics with absolute time and simultaneity, the 
state of a fluid is characterized, if one leaves aside thermodynamical con- 
siderations, by its density function p and its flow vector field v. These are 
absolute time-dependent quantities on Euclidean space E3. They satisfy 
on the one hand the conservation-of-matter equation 


Ou 4) 
DE + 0;(pv") = 0 (II.10.1) 


and on the other hand the equations of motion obtained by passing to 


the pointlike limit of the Newtonian dynamical laws applied to a small 
volume in E3. These equations are 


d Ə , 
th = := Hov’ II.10.2 
+ On 0, a Ji v (1.10.2) 


d(uv') 
dt 


where t’” is the stress tensor, deduced in the case of fluids from the force 
f that the surrounding matter exerts at a point on an element of surface 
with normal n by the linear relation 


F = tnj. (I1.10.3) 


The tensor t depends on the nature of the matter. It can be proved to 
be symmetric when the matter has no intrinsic momentum density. 

In relativistic dynamics, the fundamental macroscopic, spacetime 
quantities characterizing a fluid are the flow vector field u, a time- 
like unit vector,” i.e. uŭua = —1, and the energy density function 
u. A Lorentzian frame with timelike vector u is called a comoving or 
proper frame of the fluid. The function u on spacetime is the time 
component in the proper frame of the energy-momentum vector field 
P= pu. 

If there are no interactions (chemical, nuclear, inelastic shocks, etc.) 
modifying the nature of the underlying particles, we can define for the 
fluid a rest mass density r, proportional to the particle number density if 
all particles have the same rest mass. This scalar function on spacetime 
satisfies a conservation equation that reads in arbitrary coordinates, 


Va(ru®) =0. (1.10.4) 


II.10.1 Case of dust (incoherent matter), 
massive particles 


The dust model of matter is a good approximation to other models, 
owing to the high conversion factor in usual units between rest mass 
and other types of energy. A fluid is called dust if neighbouring volume 
elements exert no action on each other. It is then supposed that the flow 
lines are geodesics, 


u?V ue = 0. (I1.10.5) 


The following lemma will be of fundamental importance in General 
Relativity. 


Lemma IJ.10.1 Equations (10.4) and (10.5) imply that the tensor 
TH = rutuP (1I.10.6) 
satisfies the conservation law 
VaT”? = 0. (II.10.7) 


Proof. From u%ua =constant, it results that uf Vaug = 0. The proof is 
then obtained by computing first ugVal. a 


Remark IT.10.1 In a proper frame, the components of the tensor 
(10.6) are 


TX =r, T” =T! =0, TY =0. (I1.10.8) 


? 


II.10.2 Perfect fluids 


The equations of motion for Newtonian fluids are generalized to relativ- 
istic fluids by introducing a spacetime energy-momentum stress tensor 
T, or stress-energy tensor for short, such that its components T°, T° 
and T”? in a proper frame of the fluid represent energy, energy flux and 
momentum per unit volume?! relative to that frame, while the space 
part of this tensor is, in a first consideration, identified with the stress 
tensor of Newtonian mechanics. 

A relativistic fluid is called a perfect fluid if the components of T in 
a proper frame reduce to 


T° = u, energy density, (II.10.9) 
T” =0, no energy flux (no heat flow), (11.10.10) 
T’°=0, zero momentum. (11.10.11) 


and, with p a scalar function called the pressure and e the Euclidean 
metric, 


TY = pe”. (11.10.12) 
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31We do not give an axiomatic formu- 
lation of these physical notions: phys- 
ics cannot be axiomatized (nor, by 
the way, can mathematics at its very 
foundations). Eventually, energy and 
momentum are defined as mathemat- 
ical objects in elaborate theories. For 
instance, energy appears as the con- 
served quantity associated to a time- 
independent Hamiltonian. 
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32The force acting on an element of the 
boundary of an elementary volume is 
orthogonal to it: there is no antislipping 
force. 


33See, for instance, YCB-OUP2009, 
Chapter III, Section 6.2. 


The formula (10.12) coincides with the stress tensor of a Newtonian 
perfect fluid.3? 


Proposition II.10.1 The stress—energy tensor of a perfect fluid in 
Minkowski spacetime is, in arbitrary coordinates, 


TH = puul + p(n? + uut) (II.10.13) 


Proof. One checks that in the proper frame the above tensor has 
components given by (10.9)—(10.12). oO 


Remark IT.10.2 In the usual pressure and mass units, p is of the 
order of c~? with respect to u; hence, at ordinary scales, p is very small 
compared with p. 


The proper frame varies from point to point. It is not in general the 
natural frame of an inertial coordinate system. It is therefore not legit- 
imate to write the usual equations for separate conservation of energy 
and momentum by using ordinary partial derivatives and the compo- 
nents (10.9)—(10.12) of the stress—energy tensor in a proper frame of the 
fluid. The dynamical equations of a perfect fluid are postulated to be 
spacetime tensorial equations that read, in arbitrary coordinates, 


Vat? =0. (11.10.14) 


These equations can be written in an arbitrary Lorentzian metric. We 
will return to them after introducing General Relativity. 


II.10.3 Yang—Mills fields 


The electromagnetic field, and the basics of Special Relativity have been 
treated in Sections II.5-I1.7. The definition of the electromagnetic po- 
tential has been omitted so far. It is a locally defined 1-form A such 
that 


dA =F. 


Its existence, non-unique, results (Poincaré lemma) from the Maxwell 
equation dF = 0.33 The non-uniqueness of the electromagnetic potential 
A, the local character of its definition on a multiply connected mani- 
fold, and the physical properties of spinor fields in the presence of an 
electromagnetic field have led to the interpretation of A as the repre- 
sentative on spacetime of a U (1) connection on its tangent bundle, the 
electromagnetic field being the curvature of this connection. 

After the discovery of other fundamental interactions, strong and 
weak, it was found by Yang and Mills that these interactions could also 
be mathematically modelled by curvatures of connections on the tan- 
gent bundle of spacetime, but with higher-dimensional and non-Abelian 
groups. The electromagnetic and weak interactions have been unified by 
Weinberg and Salam as the curvature of an SU (2) x U(1) connection, 
with the addition of a scalar field, the Higgs field, made necessary by 
the physical fact of the distinction (called symmetry breaking) of these 


interactions at lower energy scales. The strong interactions have now 
been identified as a Yang-Mills field, namely the curvature of a SU(3) 
connection. These connections together with associate spinor fields con- 
stitute what is now called the standard model.?* We recall briefly? the 
descriptions of a classical (non-quantized) Yang-Mills field, the Yang- 
Mills equations and the conserved stress-energy tensor, which can be 
taken as a source®® for the Einstein equations. 

Let G be a Lie group and G its Lie algebra. A representative on space- 
time of a G-connection is a locally defined 1-form with values in G; it is 
called a Yang-Mills potential. The curvature of this connection is rep- 
resented by a G-valued 2-form, called a Yang-Mills field, given in terms 
of A by (with [,] the Lie bracket in G) 


F=dA+[A,A], ie. Fog = OAg—OgAq+[Aa, Ap]. (11.10.15) 
The 2-form F satisfies the identity 
dF=0, ie Valey + V~Fap + VeFya = 0, (11.10.16) 


where d is the gauge-covariant exterior differential and V is the metric- 
and gauge-covariant derivative: 


Val ey = Va Foy + [Aa Fay]; (11.10.17) 


where V is the covariant derivative associated with the spacetime met- 
ric. The Yang-Mills equations, which can be written in an arbitrary 
Lorentzian metric, consist of the identity (10.16) and the following 
equations that generalize the second set of Maxwell equations: 


Va FÊ = VoF + [Aa, F°] = 0. (11.10.18) 


Note that (except for Abelian groups G) the Yang-Mills potential does 
not disappear from these equations. 

The Yang-Mills stress-energy tensor, a generalization of the Maxwell 
tensor, is an ordinary tensor given by 


1 
Top := Fo + Fp, — geek é Fay. (11.10.19) 


where a raised dot denotes the scalar product with respect to the Killing 
form?’ of G. It is divergence-free if the Yang-Mills equations are satisfied: 


VaT? =0. 


Remark II.10.3 On a d-dimensional spacetime, the Einstein equa- 
tions with Yang-Mills source are equivalent to 


il 
Rag = Pap = Fa - Foa — an seer Fy. (11.10.20) 


The Yang-Mills stress—energy tensor has zero trace in four-dimensional 
spacetimes, as the Maxwell tensor. In this case, pag = Tap. 
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34These Yang-Mills and spinor fields 
appear in physics, up to now, only as 
quantum fields. 


35See, for instance, CB-DMI Vbis 
Problem 1. 


36Though, at a macroscopic scale, the 
Yang-Mills fields are not directly ob- 
served. 


37See, for instance, CB-DMI III D 6. 
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II.11 Problems 


II.11.1 Lorentz transformation of the Maxwell 
equations 


The Maxwell equations on R x R in non-inductive media read, with 
c= 1 the velocity of light, as follows: 
1. The evolution equations are 
ðE! OH? ƏR? OH! OE? OE? 
= J = ; (1.11.1) 
Ot Ox? Ox Ot Ox? or? 
and equations obtained by circular permutation of the indices 1, 2, 3. 


2. The constraint equations are 


OE! OE? ðE? ƏH! OH? ƏH? 

H =q, = 0. 1.11.2 
Ox? = ans! Ox! + Ox? T Ox? ( ) 
The separation into evolution and constraints is coordinate- 

dependent, as is the splitting of the electromagnetic field in electric 


and magnetic parts. Consider the Lorentz transformation 


t V Al v1 Vt 
ao. geo. e wae Tits 
vli- V? vi- V? 
and determine F', H',j',q' depending on E, H, j,q, V and satisfying 
the Maxwell equations. 


t 


IJ.11.2 The relativistic Doppler—Fizeau effect 


It has been well known since Doppler (in 1842) and experimentally veri- 
fied with great accuracy (by Fizeau in 1848) that light appears reddened 
when the source recedes from the observer, with the opposite occuring 
when the source approaches the observer. This fact is easily explained 
by the longer time between two successive pulses needed to reach the 
observer when the source recedes from him (or her). The classical for- 
mula results trivially from addition of velocities in the Newton—Galileo 
kinematics. Give an expression for the relativistic Doppler—Fizeau effect 
using the addition-of-velocities formula in Special Relativity. 


General Relativity 


II.1 Introduction 


Besides his desire to reconcile Special Relativity, valid when gravita- 
tional effects are negligible, and Newton’s law of gravitation, Einstein 
was led to his theory by physical facts and new ideas. We will not try in 
this book to follow Einstein’s long path (1907-1915) of discovery of Gen- 
eral Relativity and the Einstein equations. Their mathematics rely on 
Lorentzian differential geometry, now a well-understood subject,! with 
which we started this book. 

The physical facts that inspired Einstein’s genius are the principle of 
general covariance and the Galileo-Newton equivalence principle. 


II.2 Principle of general covariance 


The principle of general covariance is an extension of the principle 
called ‘material indifference’ in Newtonian mechanics, which essentially 
says that physical phenomena do not depend intrinsically on the ref- 
erence frame in which we express their laws.? The laws may look very 
different for different observers, and hence in different frames, but it 
should be possible to find frame-independent formulas for them. Tensor 
fields are good candidates as objects for physical laws, being intrinsic 
geometric objects on a manifold. Tensors can be represented by their 
components, specific numbers attached to them by the choice of a par- 
ticular reference frame but with general laws for passage from one frame 
to another. 


II.3 The Galileo-Newton equivalence 
principle 
The fundamental law of Newtonian dynamics, 
F = myy, (11.3.1) 


relates the acceleration y of a test particle? in Galileo-Newton absolute 
spacetime E? x R to the force F acting on it and to its inertial mass 
my. In Newtonian mechanics, this inertial mass is a constant, depending 
only on the nature of the particle. 
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lIn fact, General Relativity has given a 
great impulse to the study of Riemann- 
ian and Lorentzian differential geom- 
etry. 


2A principle that seems trivial, if we be- 
lieve there exists a reality, although it 
becomes debatable in quantum mech- 
anics. 


3Which is considered to be pointlike. 
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4The story, perhaps not history, is that 
Galileo verified this universality of fall 
by dropping objects from the Lean- 
ing Tower of Pisa. This fact has been 
observed repeatedly down the years, 
perhaps since Philiponos Eramnatikos 
in Alexandria around AD 300. Astro- 
nauts on the Moon verified it again by 
dropping at the same time a piece of 
metal and a feather (which on Earth 
would have been slowed down by the 
atmosphere). 


5Traditionnally, one introduces a pas- 
sive and an active gravitational mass— 
we bypass this distinction. By analogy 
with the electric field, we could have 
called the function pg a gravitational 
charge density, but since it will be iden- 
tified with the inertial mass density, a 
variable quantity in Special Relativity, 
the analogy would be misleading. 


6The gravitational acceleration of a 
point particle of inertial mass my; due 
to pq is the gradient of the potential 
U, independent of my. On the Earth’s 
surface, the acceleration due to gravity 
is approximately 9.81 cms~?. 


TNewton himself had some doubts 
about this instantaneous “action at a 
distance”, which contradicts our hu- 
man experience. 


8With proper units of mass relative 
to the units of time and length, one 
can set Gy = 1, which is done by 
many authors. Others prefer, as we do 
in this book because it is more con- 
venient when working with the Einstein 
equations, to choose units such that the 
proportionality factor of the Einstein 
tensor and the stress—energy tensor of 
sources is equal to 1. 


In its primitive, fundamental form, the equivalence principle is the 
expression of the fact that the acceleration y due to gravity of a massive 
body is independent of its mass.“ 

In Newtonian gravitation theory, the acceleration of a test par- 
ticle in a gravitational field depends only on its location in space 
(universality of free fall); it satisfies a differential equation that reads, 
in inertial coordinates, 


oe oU 

d Əri 
The function U, called the gravitational potential, satisfies the 
Poisson equation 


(III.3.2) 


8? 


AU = —4rpG, A:= 5 TIA 


i=1,2,3 


(III.3.3) 


where A is the Laplace operator on Euclidean space E’ while the source 
pa is a, possibly time-dependent, positive scalar function on R? equal 
to the ‘gravitational mass density’ of the sources at time t. We do not 
include in the definition of the potential of pg the Newtonian gravi- 
tational constant Gy. We will introduce Gy later after proving the 
proportionality of inertial mass and so-called gravitational mass. 

Classical results on elliptic equations, in particular on the Laplace 
operator, imply that the Newtonian gravitational potential U is given 
at each instant of time and each point x € E? by the space integral® 
over E? at this time:” 


U(x) = | PCY) By By = dy! dy? dy?. (II1.3.4) 
rs |e — y| 

The equivalence principle, the identity of the gravitational mass and 
the inertial mass up to a choice of units, can now be proved through 
the Newtonian principle of equality of action and reaction, as follows. 
Let the source of gravitation be a pointlike particle P located at the 
origin O with gravitational mass mg; that is, a density pg := mað, 
where ô is the Dirac density at O. This particle generates a potential 
with value U = mg/r at a point x at distance r from O. The attractive 
Newtonian force generated by P on a particle P’ of inertial mass m/, 
located at x has intensity m@m,/r?. We can infer from the principle 
of action and reaction that mem’, = mem, and hence the quotient 
mg/m, is independent of the considered particle. Following the usual 
notation, we denote the constant by Gy, and call Gy the Newtonian 


gravitational constant, and we have 


The value of this universal constant depends on the choice of units 
for time, length and mass. It is approximately, in CGS units (see 
Section III.5), the very small number’ 


Gy =6,67259 x 1078 cm? g7! s7?. 


The Newton equivalence principle (the equality mg = my; up toa 
choice of units of inertial and gravitational masses) has been verified by 
experiments on the motion of pendulums by Galileo and Newton (1610- 
1680), followed by several subsequent accurate experiments by Eötvös 
(1860-1908) and in modern times by Dicke, Braginski and especially 
Aidelberger and his group, comparing with a torsion balance the gravi- 
tational force and the inertial, centrifugal, force due to rotation. There 
have been also accurate tests of this equality from celestial mechanics, 
starting from work of Newton and Laplace, up to modern tests based on 
laser ranging to the Moon. 


II.4 General Relativity 


We have in Chapter II expressed the metric of Minkowski spacetime, 
the arena of the dynamics of Special Relativity, in arbitrary coordin- 
ates. We have seen that in the equations of motion of test particles, the 
inertial forces appear through the Christoffel symbols of the metric.® 
This fact, the Galileo-Newton equivalence principle and his genius in- 
spired Einstein to replace the Minkowski spacetime of Special Relativity 
by a general Lorentzian manifold (hence the name General Relativity). 

The Newtonian gravitational potential had no influence on Newtonian 
absolute spacetime structure. The new revolutionary idea due to Einstein 
is that space and time are not a priori given structures but are united in 
a four-dimensional curved Lorentzian manifold whose metric g is linked 
with the energy content of spacetime. This Lorentzian metric governs 
the spacetime causality structure (see Chapter I). The length of a 
timelike curve measures the intrinsic, called proper, time along that 
curve: in Relativity, Special or General, the basic observable quantity 
is proper time. It is assumed that quantum phenomena, namely vibra- 
tions of atoms, permit its measure, independently of everything else. The 
official clock is at present the caesium atom. 

In General Relativity, as was already the case in Special Relativity, 
there is no intrinsic notion of simultaneity, or, therefore, of spatial dis- 
tances. Velocities are defined mathematically in the tangent space at 
the location of an observer at a point of the spacetime (see Chapter IT). 
Assuming that the speed of light is an absolute constant is physically 
sensible because it is consistent with the experimental observation that 
at any given point, the velocity of light is independent of direction, and 
when measured on Earth with standard of length a solid body, it ap- 
pears to be constant with extremely great accuracy. Taking the velocity 
c of light equal to 1 for any observer is no restriction—it is a definition 
of the unit of length in terms of the unit of time.'° 


III.4.1 Einstein equivalence principles 


Unification of gravitation and inertia is at the root of Einstein General 
Relativity. 
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Inertial forces (for example Coriolis 
forces) appear also in Newtonian mech- 
anics through non-inertial (i.e. non- 
Galilean) frames. 


10 Evaluating astronomical distances in 
a time unit, namely the light-year, and 
saying that this is the time it takes 
light to travel from one object to an- 
other is misleading since the proper 
time is zero along a light ray. The dis- 
tance is not the length of a spacelike 
geodesic joining two points of space- 
time, but is defined in a more subtle 
way and depends on the trajectories 
of the objects in the Lorentzian mani- 
fold that represents the spacetime (see 
Section III.5). 
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11 This is known under the name of the 
‘Anthropic principle’. 


In General Relativity, massive pointlike objects in free fall are as- 
sumed to follow timelike geodesics of the metric, and their equations 
of motion are therefore independent of their mass, reading, in arbitrary 
coordinates, 


oe a dz de 
ds? ' PY ds ds ”’ 


with s the proper time (see Chapter I). The connection IT of the metric 
represents in the coordinates x“ both gravitation and inertia. This is 
often called the weak Einstein equivalence principle. This priciple 
is used in the formulation of equations of motion in a given gravita- 
tional field of massive bodies small enough at the considered scale to 
be taken as pointlike, with mass negligible compared with the masses 
that produce the gravitational field: this is the case for the motion of 
the planet Mercury in the gravitational field of the Sun. The advance of 
the perihelium of its elliptic orbit was the first spectacular confirmation 
of Einstein General Relativity (see Chapter V). 

Light rays are null geodesics, trajectories of particles with zero rest 
mass. Their deviation by gravitation was first verified during the 1919 
solar eclipse. 

The weak Einstein equivalence principle can be shown to be a par- 
ticular case, modulo the Einstein equations (see Chapter IV), of the 
Einstein equivalence principle, which says that in the dynam- 
ical equations previously expressed in Special Relativity, the spacetime 
Minkowski metric 7 must be replaced, if the gravitatonal field is not 
negligible, by the spacetime Lorentzian metric, usually denoted by g. 
One sometimes speaks of the strong Einstein equivalence principle when 
self-gravitational phenomena are important. 

We will treat in Chapter IV the coupling of gravitation and matter. 


III.4.2 Conclusion 


The Einstein equivalence principle has its root in Einstein’s basic idea 
of identifying gravitation with inertia. In particle physics in Minkowski 
spacetime, there appear phenomenological constants that are dimension- 
less, and hence cannot be given a geometric value by choice of units: their 
variation in spacetime would violate the Einstein equivalence principle. 
No such violation has been found up to now. The fine-structure con- 
stant a := e?/hc has been estimated to have a fractional variation of 
no more than order 1077 from analysis of a natural fission reactor phe- 
nomenon that took place in Oklo, Gabon two billion years ago. Another 
dimensionless constant, the ratio 4 = Me/Mp of the rest masses of elec- 
tron and proton, has been estimated with a precision of 107° through 
measurements of absorption lines in astronomical spectra. The value of 
these constants is puzzling. Some physicists have remarked that if the 
constants of nature were not what they are, life as we know it would not 
exist and that this is enough of an explanation for our observations.'! 
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III.5 Constants and units of measurement 


This section recalls definitions, common to Newtonian and Einsteinian 
mechanics, that permit us to express quantitative facts, as required for 
the confrontation of theory and reality. The three familiar items of every- 
day life are distance, mass and time. The fundamental requirement here 
is the definition of units that are both common to a group of people 
and reliable. The system of units used in nearly all countries in sci- 
ence and technology, and in most of them in everyday life, is the CGS 
system.!? The Convention, an assembly of the French Republic, defined 
in 1795 the metre (100 centimetres) as the 1/(4 x 10°) part of the merid- 
ian of the Earth and the gram as the mass of 1 cm? of water. Standards 
of mass and length made of platinum were deposited at an Institute in 
Sèvres—a kilogram (1000 grams) for the mass, and were taken as the 
definitions of the metre and kilogram, replacing the physical quantities 
previously used to define them. The unit of time, the second, was defined 
first as a fraction of the solar day, then as a fraction of a specific year 
and then of a sideral day.!% 

Modern science has defined fundamental universal dimensional! 
constants. Two such constants can, in principle, be used to define two 
units in terms of a third by giving an arbitrary fixed value to these 
constants. If this arbitrary value is 1, it is called the geometric value 
and the units deduced from it are called geometric units. However, one 
of the units must be defined by an experimental device with verifiable 
reproducibility. 

An international committee fixes the definition of the officially chosen 
units, called SI for ‘Systeme International’. Of course, multiples of these 
units can equivalently be used when they lead to values that are easier to 
grasp at the relevant scale, as is the case with the mks (metre, kilograms, 
second) and cgs systems. 

The velocity of light, c, observed experimentally to be constant to very 
great accuracy, is, in metres per second, 


Cm s-1 = 299792 458=3 x 10°. (III.5.1) 
This equality is now taken as fixing a relation between the units of 
length and time. It is the unit of time that is actually chosen’ as basic 
for formulation of experimental results; it is defined at present by the 
period Teaesium Of a particular atomic transition of caesium, predicted to 
be constant by quantum mechanics and observed experimentally with a 
very high accuracy to be the number 


Teaesium = second. (11.5.2) 


9192631770 
The second is now defined by this equality. In the future, it will probably 
be defined with still higher precision by an optical clock. 

The velocity of light will be equal to 1 if we choose as unit of length 
the distance!® covered by light in 1 second. We denote” this distance 


12 centimetre, gram, second. 


13Defined by observation of the stars. 


14A constant is called dimensional if its 
value depends on the units chosen. The 
dimensional equation of a function de- 
pending on positive or negative powers 
PL, pm and pr of length, mass and time 
is 


LPL MPM TT, 


For example, an acceleration has di- 
mensional equation LT~?, a force 
MLT~?. Under a change by a propor- 
tionality factor k of a unit with power p 
in the dimensional equation, the func- 
tion changes by a proportionality factor 
kP. 


15For a while, it was the unit of length 
that was chosen, namely the wave- 
length of a specified ray of the radiation 
emitted by a specific transition of the 
krypton atom, which replaced the def- 
inition of the metre by the standard in 
Sèvres. But it was less precise and less 
convenient to use. Remember that as- 
tronomers have long been using time to 
measure distance, namely light-years. 


16Note that, in Relativity, the defin- 
ition of distance is ambiguous since 
there is no absolute definition of sim- 
ultaneity. 


17 One could call it a light-second. 
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18The dimensions of Gy can be de- 
duced from the fact that Gym/r? is 
an acceleration, and hence has dimen- 
sions LT~2. The dimensions of Gy are 
therefore L?M-!T~-2., 


19This lack of accuracy is due to the 
extreme weakness of the gravitational 
force between bodies of laboratory size. 
Another difficulty appears with masses 
of planetary size: this is that only the 
product of Gyn with their mass ap- 
pears in their equations of motion, and 
these masses must be determined from 
the planetary orbits—which requires 
knowledge of the value of Gy. 


20This official definition does not pre- 
vent mathematicians from setting the 
gravitational constant equal to 1 in the 
equations on which they work. It must 
only be remembered what this means 
when one wants to make numerical 
estimates using various specified units. 


by Ls, and it holds that 
G ger 1. 
The metre is defined, using (5.1), by the formula 
1 metre = (299792 458)~!L,. 


A unit of mass could be defined through the previously chosen units by 
taking the gravitational constant equal to 1; in cgs units, it is observed 
to be,!® 


G N,cgs = 6.67259 x 107°. 


However, the lack of sufficient accuracy! in the knowledge of Gy has 
so far prevented the use of this formula to define a unit of mass in terms 
of the length or time unit. It has recently been decided (July 2011) by 
the BIPM (Bureau International des Poids et Mesures) that Planck’s 
constant h, which is known with a very high accuracy, should be used 
to officially define a unit of mass-energy. It is now h that is a geometric 
quantity.?° 


III.6 Classical fields in General Relativity 


We have written in Chapter II the expressions for the stress—energy 
tensors of the usual and simplest forms of matter and fields in Special 
Relativity. The Einstein equivalence principle leads immediately to their 
expressions in General Relativity. We give them in a general Lorentzian 
metric g on a manifold V of arbitrary dimension n + 1, although the 
physical case on a macroscopic scale is n = 3. 


III.6.1 Perfect fluid 


The tensor representing the energy, stress and momentum of a perfect 
fluid is, in arbitrary coordinates, the following symmetric 2-tensor on V: 


Tap = HUaug + p(gas + Vata) (III.6.1) 
where u is the unit timelike flow vector, i.e. such that 

gapueu? = —1, 
while u and p are scalar functions, respectively the pointwise energy and 
pressure density of the fluid, both of which are non-negative in classical 
situations. 


Exercise III.6.1 Show that in a proper frame of the fluid, i.e. an 
orthonormal frame for the metric g with timelike vector u, the com- 
ponents of the tensor T are respectively the fluid pointwise densities 


Too = w(energy), To; = 0(momentum), Ti; = pôi; (stresses). 
In the case of dust, p = 0. 
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The Einstein equivalence principle implies that a relativistic perfect 
fluid satisfies the conservation equations 


VaT? = 0, 


which have been found in Special Relativity as consequence of the 
classical conservation laws of energy and momentum. 


IlI.6.2 Electromagnetic field 


The electromagnetic field is an exterior 2-form on the manifold V. The 
equivalence principle says that on the Lorentzian manifold (V, gq), 
the electromagnetic field satisfies, in the absence of electric charges, 
the vacuum Maxwell equations 


dF=0, V-F=0. (III.6.2) 


Its stress energy tensor is the Maxwell tensor 
1 
Tap = FoF ax — 7909p F Fau- (III.6.3) 


As foreseen from the equivalence principle, if an electromagnetic field 
satisfies the sourceless Maxwell equations, then its Maxwell tensor is 
divergence-free: 


Var”? =0. (III.6.4) 


The covariant derivatives that appear in these equations of motion are 
a symptom of the influence of gravitation on electromagnetism. 


III.6.3 Charged fluid 


In an electrically charged fluid, there is an electromagnetic field gener- 
ated by the fluid motion, which in turn is influenced by the electromag- 
netic field Fag. In the simplest classical model, the fluid is a perfect fluid 
with unit velocity u, and energy and presure densities u and p, and the 
electric current is the sum of convection and conduction currents: 


J” = qu“ + o“ 


The stress-energy tensor is the sum of the fluid stress—energy tensor and 
the Maxwell tensor: 


Tog = (u + p)Ugtg + PGag + Tag- 


Exercise ITI.6.2 Given the metric g, write the coupled system of 
equations satisfied by u, u, p,q and F. 


Hint: Use the equations 
VaT =0 and Var =ugJ®. 
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?1For details, see Will (2014). 


Ill.7 Gravitation and curvature 


There is no intrinsic splitting between gravity and inertial-type forces. 
In a general spacetime (V,g), it is always possible at a given point 
(see Chapter I) to choose local coordinates such that the Christof- 
fel symbols vanish at that point; gravity and relative acceleration are 
then, at that point, exactly balanced. It is even possible to choose lo- 
cal coordinates such that the Christoffel symbols vanish along a given 
geodesic—astronauts spacecrafts have made popular knowledge the fact 
that in free fall one feels neither acceleration nor gravity; in a small 
enough neighbourhood of a geodesic, the relative accelerations of objects 
in free fall are approximately zero. 

Nevertheless, gravity is a physical reality that cannot be assimilated 
with the old notion of inertia. In a general Lorentzian manifold, there is 
no coordinate system (unless the metric is locally flat) in which all the 
Christoffel symbols vanish in the domain of a chart; a family of geodes- 
ics cannot in general be represented by straight lines. In the presence 
of a non-everywhere-vanishing mass or energy density, gravity manifests 
itself by the non-vanishing of the Riemann curvature tensor Riem(g). 
If Riem(g) does not vanish in an open set U of a spacetime (V, g), 
then the gravitational field in U cannot be identified with an inertial 
field. The existence of a non-zero curvature of the metric is revealed, 
beyond the first approximation, by the relative acceleration of test par- 
ticles, as is predicted by the equation of geodesic deviation (Chapter I, 
Section 7). 


II.8 Observations and experiments 


The following observations have been made and experiments performed 
to confirm the predictions of General Relativity.?! 


III.8.1 The Einstein equivalence principle 


The fact that the acceleration at a point in a gravitational field of bodies 
small at the considered scale is independent of their nature is common 
to both Newton’s and Einstein’s theories of gravitation. It has been 
verified with an accuracy of the order of 10713 in several experiments— 
in particular in the laboratory with atoms of titanium and beryllium 
and a very sophisticated apparatus based on a torsion balance, and also 
at an astronomical scale by refined analyses of results of lunar laser 
ranging (LLR) to determine precisely the orbit of the Moon together 
with sophisticated approximation methods to show the compatibility 
of this orbit with the equivalence principle applied to the system Sun- 
Earth—Moon. 


II.8 


III.8.2 Deviation of light rays 


A consequence of Einstein’s postulate of the equivalence of mass and 
energy is that particles of zero mass should also follow geodesics of the 
spacetime metric. Light rays in particular are null geodesics, and light is 
deflected by a gravitational field.?? This was confirmed by observation 
as early as 1919 by Eddington and Dyson during a total solar eclipse 
(though not with very high precision, because of large experimental er- 
rors). A solar eclipse makes it possible for a photograph of the sky to be 
taken in which stars are visible, so that their positions can be compared 
with those in a photograph taken in the absence of the Sun. Greater 
precision is now obtained for the deflection of radio waves, which does 
not need an eclipse to be observed. 

A spectacular verification of the deflection of light rays by a gravita- 
tional field is provided by the observed phenomena called gravitational 
lensing, in which the light rays from a star or galaxy, i.e. the null 
geodesics issuing from some point in curved spacetime, may intersect 
again such that an observer can see several images simultaneously. Sev- 
eral gravitational lenses have now been observed,?? in particular the 
‘Einstein cross’, obtained by the Hubble telescope, consists of four im- 
ages of the same quasar; in the centre appears a fainter image of the 
galaxy whose gravitational field acts as the lens.?4 


III.8.3 Proper time, gravitational time delay 


As in Special Relativity, the physically measurable quantity is proper 
time, which is the length in the spacetime metric of timelike curves. It 
is again postulated that the period of specific spectral radiation from 
atoms provides universal clocks for the measure of proper time. 

A consequence of the reality of the dependence of this physically meas- 
urable time on the presence of a gravitational field is the observation of 
a redshift in the spectrum of a given atom in a gravitational field, as 
seen by a distant observer for whom this field is weaker. Such shifts add 
to the Doppler effect for bodies in motion. The reality of the Einstein 
effect was verified in the years 1926-1928 with convincing accuracy by 
careful analysis?” of the lines of various elements in the solar spectrum 
by St John, an astronomer working in the Mount Wilson observatory. 
In 1960, the Mossbauer resonant effect (emission of y rays with an 
extremely narrow profile, reabsorbed in crystals with a very sharp res- 
onance) enabled Pound and Rebka to measure the shift of spectral lines 
due to the variation of the gravitational field with height above the 
Earth’s surface in their laboratory. We give below a brief account of the 
theoretical prediction in this case. 

Inspired by the Newtonian approximation (see Section IV.5 in Chap- 
ter IV), we assume the spacetime metric to be, in the laboratory, 


g = —N?(a1)dt? + gii(x')(dx')? + gardx*dz’, a,b=2,3, (III8.1) 
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?2Newtonian gravitational theory also 
predicts light deflection, with light 
treated as small but may be massive 
particles. But the Newtonian light de- 
flection is exactly half of that predicted 
by Einstein and verified by Eddington. 


23 See, for instance, Schneider 
et al. (1992). 


24See for instance Figure 4.16 in Chap- 
ter 4 of Ohanian and Ruffini (2013). 


25See details on pp. 126-130 of Bruhat 
(1931). 
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where x! varies with the height of a point while z, a = 2,3, label the 
position of this point in a horizontal plane. Consider two identical clocks, 
one at rest on the ground, say at x! = 0, the other at rest on the same 
vertical, with height labelled by xt = h. The hypotheses are therefore 
that, in the coordinates x“, the representatives of the world lines of the 
clocks lie in the plane (xt, t); they are the lines where only t varies, given 
respectively by z! = 0 and xz! = h. Let T(0) and T(h) be the periods, 
in proper time, of these clocks. The signals of the beginning and end of 
a period of the clock on the ground are transmitted by light rays to an 
observer handling the other clock. Such light rays obey the differential 
equation 


dt „Jn 
— = ; III.8.2 
dax1 N eee) 


A signal emitted at parameter time t(0) reaches the clock at height h at 
parameter time 


h 
i(h) = | VIn (51) dx” + (0). (III.8.3) 
o N 

The difference in the parameter times between two emitted and two 
received signals is therefore the same. However, this statement is not 
true for the proper times, which are the physically measured quantities. 
We denote by T (0) (respectively T(h)) the period in proper time of the 
clocks. Using the relation between the parameter time and the proper 
time for each of the clocks, namely dT = Ndt, we find that 


to(0) — t1 (0) = N HOT (0) = to(h) — tı (h) = N71(h)T (h). (IIL8.4) 


The corresponding lapse of proper time marked by the clock of this 
observer is 


T(h) = —~T(0). (III.8.5) 


A longer period is observed (redshift) if N(h) > N(0), which is the case 
in our laboratory example where the source of gravitation is the Earth 
with mass m. Then (see the Newtonian approximation) 


m 


N=1- Tr (III.8.6) 
Theory and experimental results have proven to be in excellent agree- 
ment. 

A related effect (which involves the geometry of null geodesics connect- 
ing the timelike world lines of various massive bodies) has been measured 
directly for celestial objects, and called time delay by Shapiro, who 
measured it first, in 1966, by sending radar signals to planets (i.e. in the 
gravitational field of the Sun) and measuring the time elapsed on Earth 
between their emission and their return after reflection. We will give the 
calculations leading to the theoretical prediction in Chapter V, devoted 
to the Schwarzschild spacetime. 


The reality of the proper time defined on spacetime by a Lorentzian 
metric that is not locally flat in the presence of a gravitational field has 
been checked directly by carrying caesium clocks on satellites around the 
Earth and observing that they gain some hundred nanoseconds”° over 
identical clocks that remain on the ground. The experiment has now 
been done with clocks on satellites orbiting Mars and on its surface. 

Finally, it has been found that the accuracy of the GPS position- 
ing system depends upon General Relativistic correction of the physical 
(proper) time, both because the satellites that receive the signals from 
Earth are far above the Earth’s surface and because, for high-precision 
results, it is necessary to take into account that the Earth is not exactly 
spherical, and has plains and mountains that induce variations in the 
gravitational field.?” 


III.8.4 Conclusion 


All experimental results to date?® are in agreement with the predictions 
of Einstein’s General Relativity. 


II.9 Problems 


III.9.1 Newtonian gravitation theory in absolute space 
and time E” x R 


1. By analogy with the case n = 3, the Newtonian potential on the 
Euclidean space E” of a gravitational mass density pg is a scalar 
function U given by 


nje 


U(x) := i PG) ay, with 
R» |x — y| 
Show that U satisfies the elliptic partial differential equation 


8? 
> a (Ox)? U= -(n 


i=1,..., 


2) (area S"—") pg 


2. Show that the Newtonian potential of a pointlike particle with gravi- 
tational mass density mgô, where 6 is the Dirac measure at the origin 
O, is 
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26 After correction for the kinematic 
effect of Special Relativity. An approxi- 
mately drag-free satellite describes a 
geodesic of the Lorentzian (Schwarz- 
schild) metric, but the observer on 
Earth, not in free fall because held by 
the ground, does not. 


27We compute in Problem IV.11.5 in 
Chapter IV the first correction (quad- 
rupole moment) to the Newtonian po- 
tential due to non-sphericity of the 
source. In Newton’s theory of grav- 
ity, the potential has no influence on 
time. We will study in Chapter IV the 
Newtonian approximation of the Ein- 
stein theory of gravity, in which time is 
dependent of the gravitational field. 


?8For a description of basic experi- 
ments, their results and discussions, 
see, for instance, Chapter 4 of Ohanian 
and Ruffini (2013). For up to date re- 
sults, see papers by Clifford Will, for 
instance his recent review (Will, 2014). 
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29See more comments in Chapter V. 


Hint: The Dirac measure at O is by definition such that if f is a 
continuous function at O, then 


a f(y)dd"y = f(0)(area $"~?). 


3. Assuming the laws of Newtonian dynamics and gravitation, 
F=myy, with my the inertial mass and y= gradU, 


show by using Newton’s principle of action and reaction that mg is 
proportional to mz by a fixed constant. 


III.9.2 Mass in length units (case n = 3) 


The value of the Newton gravitational constant Gy is, in cgs units with 
a 107! accuracy, 


Gn,cgs = 6.67259 x 1078 =7 x 107° . 


Compute the unit of mass in the centimetre-gram system (in which the 
second is expressed in terms of the centimetre by taking c = 1) under 
the condition that the Newton gravitational constant is equal to 1. 

Compute in this unit the mass of the Earth, whose mass is about 
6 x 1074 cgs grams in the cgs system. 


Solution 


This uses only the dimensions L?M~!T-? of Gy. Recall that 

1s=3 x 10° cm-s. (II1.9.1) 

Set 1g = ucm-g. The trivial fact that the quotient of the values of 
a quantity in different units is the inverse of the quotient of the units 
gives 
Gy,geom,em = Gn,cest a x 10-7? = 1, 

with 

Gn,cgs =7 x 107°. 
Hence, GN geom = 1 if p = z x 10728. Then Megarth = 6 x 10?” g implies?’ 


42 


MEarth,geom = z7 CM. 
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III.9.3 Planck units 


The dimensional constant that has been measured with the greatest 
accuracy is the Planck constant h. Planck remarked that it is possible 
to define from Gy, h and c units for time, length and mass, called Planck 
units or geometric units, such these constants all take the value 1. 


1. 
2. 


Show that the dimensions of h are ML?/T~?. 

Assume that the three constants c, Gy and h are known in cgs units. 
Denote by Lp = xcm, Mp = ygrams and Tp = zseconds new units 
such that these constants are equal to 1. Such units are called Planck 
units. Write the three equations satisfied by x,y and z. 


Compute the unique solution giving the Planck units. 


Solution 


a 


h = E/v. The dimensions of energy E are ML?/T~? and those of 
frequency v are T71. The result follows immediately. 

The relation between cgs and Planck units and the dimensions of the 
given constants imply 


yz? Zz 


zZ 
CP = Cogs 7y? Gp = Gegs 3 hp = legs a) 


and setting these constants equal to 1 gives 


x x yx 
= = Cogs; yee = Gees, 
which imply trivially 
T £ Z 3 gaml _ 


— — a2 — 3 
z = Cegs, y = Cogs G cgs» y — CogsGcgs; isi 6 T hecgs. 


Therefore, the unique values of the unknowns x,y and z are 


= >; — 2 =l me —1 
T= ~z ~ YF Teog Oe Z = Ueogs, 
Cogs 
which give 
_ hogs — hogsCegs j= Regs Gegs 
3 ? 7 i _ 5 ` 
Cogs Gegs Gegs Cogs 


Of course, the use of Planck units in formulas does not eliminate 
the need for at least one of them to be defined through a physical 
phenomenon, to link equations with reality. 
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1Except for the advance of the peri- 
helion of the orbit of Mercury, which 
behaves approximately like a slowly 
rotating ellipse whose perihelion ad- 
vances 42” per century more than that 
computed using Newton’s law when 
taking into account the gravitational 
fields due to other planets. 


2Finstein was helped in obtaining the 
final result by his friend, the mathem- 
atician Marcel Grossmann. 


The Einstein equations 


IV.1 Introduction 


The previous chapter replaced the Minkowski spacetime (R*,7) of Spe- 
cial Relativity by a general Lorentzian manifold, (V4, g)—hence the 
name ‘General Relativity’. The connection coefficients of the metric g 
replaced both the gravitational and the inertial forces. There are space- 
time frames where at a given point, or even along a given spacetime 
curve, they cancel each other. This statement is in agreement with the 
weak Einstein equivalence principle, which concerns the motion of one 
test particle. However, in contrast to inertial forces in Newtonian theory, 
which disappear in inertial frames (without canceling gravitation), there 
are no reference frames defined on an open set of a spacetime of Gen- 
eral Relativity where all the connection coefficients vanish, except if the 
metric is flat, i.e. free of gravitation. In Einstein gravity, the existence 
of a non-zero gravitational field manifests itself by the curvature of the 
spacetime metric. 

The non-Minkowskian metric g replaces in a sense Newton’s gravi- 
tational potential, which satisfies the Poisson equation, a linear elliptic 
second-order partial differential equation with mass density as a source. 
The metric g should satisfy equations that give in first approximation 
the same results as Newton’s law for the motions of bodies in a weak 
gravitational field and with velocities small with respect to the speed 
of light. Indeed, in these circumstances, Newton’s law had been veri- 
fied with excellent agreement.! Einstein’s geometric foresight looked for 
geometric relations between the metric g of General Relativity and pos- 
sible sources of gravitation. The equivalence of mass and energy that he 
had discovered before, the symmetric 2-tensor character of the metric 
g whose curvature he wanted to link with the gravitation sources, and 
his genius all led Einstein, after various unsuccessful attempts,” to the 
famous Einstein equations that are at present more fundamental than 
ever for theoretical physics at all scales. 

We will write the equations in spacetimes of arbitrary dimension n+1, 
since they are now used by physicists in a wider context than the original 
dimension n = 3. We will specify that n = 3 only when this leads to 
special properties. 


IV.2 


IV.2.1 The Einstein equations in vacuum 


The Einstein equations 


The vanishing in a domain of spacetime of the Riemann curvature ten- 
sor of the metric g implies that this metric is locally flat in this domain 
(see Chapter I), and hence without gravitational effects. It is therefore 
too strong a condition to impose in domains empty of matter or field 
energy sources in a spacetime that is not globally empty. A natural can- 
didate for an equation to impose in vacuo? on the Lorentzian metric g, a 
symmetric 2-tensor, is the vanishing of the symmetric 2-tensor Ricci(g), 
which is related to the Riemann curvature tensor by contraction. The 
tensor Ricci(g) is a second-order partial differential operator on g, like 
the Laplace equation for the Newtonian potential, though it has quite 
different properties, as we will see in the following sections: first, it is 
nonlinear;* second, it presents both elliptic properties like the Newton- 
ian potential and hyperbolic ones like the wave equation, corresponding 
to propagation of the gravitational field with the speed of light. We 
formulate the basic proposition® of Einsteinian gravitation. 


Proposition IV.2.1 In an open set U devoid of energies other than 
gravitation, the Lorentzian metric g of an Einsteinian spacetime (V, g) 
satisfies the following tensorial Einstein equations in vacuum: 


Ricci(g) = 0. (IV.2.1) 

The equations (2.1) are represented in each chart (see Chapter I) 
with domain included in U by the following system of second-order hy- 
perquasilinear® partial differential equations for the components Jag of 
the metric g: 


ð ð 
3 = T 
aP = pra P pra 


A LTA à = 
R tht =T =o, 


a TË (IV.2.2) 


where the re g are the Christoffel symbols of the metric g, given in local 
coordinates (see I.7.13) by 

dJa 

Oxt J` 


These equations are not independent; they satisfy the identities deduced 
in Chapter I from the Bianchi identities: 


Ibu 


ith 
wi Aue 


ere = Hla, u], 


1 /ƏJap 


1 
Va8S =0, Sap := Rap — z9apR, R:= gò™ Rau.  (TV.2.3) 


The tensor with components” Sag is called the Einstein tensor, it is 
the symmetric 2-tensor with geometric expression 


1 
Einstein(g) := Ricci(g) — z9P(9). 
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3The notion of vacuum seems clear to 
common sense: it is a region of space- 
time where there is neither matter nor 
electromagnetic field. Modern physics 
has shown that what we called the vac- 
uum is indeed full of strange things— 
pairs of particles and antiparticles con- 
stantly annihilating each other and lib- 
erating virtual photons or neutrinos, 
which in turn give birth to pairs of par- 
ticles. But these stange phenomena are 
in the realm of quantum field theory 
and do not concern us in this book. 


4Gravitation is also its own source. 


5It is known now that the Einstein 
equations are the only tensorial ones 
(up to addition of a ‘cosmological 
term’, see Section IV.3) that are of sec- 
ond order and quasilinear for the metric 
g in four dimensions. 


6A partial differential equation of order 
m is called quasilinear if it is linear in 
the derivatives of order m. It is called 
hyperquasilinear if, in addition, the co- 
efficients of these derivatives do not 
contain derivatives of order m — 1. 


TDenoted by Gog by many authors. 
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81t is only in the classical dimension n+ 
1 = 4 that d—2 = 2—a fact that should 
not be forgotten when working in other 
dimensions. 


°In particular when there is a density 
of linear momentum, which is natur- 
ally represented by a non-symmetric 
2-tensor. This is the case for electro- 
magnetic fields with induction. 


l0Denoted by « by some authors. 


IV.2.2 Equations with sources 


It took a considerable time for Einstein to settle on the choice of equa- 
tions connecting the metric g with sources. From a physical point of 
view, he was inspired on the one hand by the relativity to observers 
of the splitting beween energy and momentum (see Chapter IT) and by 
the conservation laws for the various stress-energy—momentum tensors 
T found in Special Relativity and on the other hand by the equiva- 
lence principle. From a mathematical point of view Einstein’s choice of 
equations was motivated by the contracted Bianchi identities. 

The geometric equations found by Einstein in 1915, with the help of 
the mathematician Marcel Grossmann, are the very simple ones 


1 
Einstein(g) := Ricci(g) — 3 9R(9) =G; 
i.e. in local coordinates and arbitrary dimension n + 1 > 3, 


1 
Sop = Rap = agaak = GeTag. (IV.2.4) 


Exercise IV.2.1 Show that in a spacetime of dimension n+ 1 > 2, 
the Einstein equations with sources are equivalent to the following:® 


; TA 
Rag=Gepos, With pag = Tap- ap. (IV.2.5) 


The source T is the stress-energy symmetric 2-tensor, supposed to 
represent the pointwise value of all the energies, momenta and stresses 
present in the spacetime. It is a phenomenological tensor whose choice 
is not always easy,’ even in the classical dimension n + 1 = 4 and even 
without taking quantum mechanical considerations into account. 

The factor? Gz is a phenomenological dimensional constant. We will 
explain later, by using the Newtonian approximation, why in spacetime 
dimension n + 1 = 4 physicists take 


Gr = 87Gn, 


with Gy the Newtonian gravitational constant. 

Note that the gravitational constant does not appear in Special 
Relativity—hence, the Einstein equivalence principle does not imply 
that it is a constant. In fact, several physicists (Jordan, Thiry, Dicke, 
Dirac, and others) have conjectured that it is a spacetime scalar function. 
However, no experiment has confirmed this conjecture. 

The geometric units of time and length have been defined in Chap- 
ter III. In these units, the speed of light is equal to 1. 

In this book, as in most mathematical studies, except when making 
numerical estimates, we assume units of mass-energy chosen such that 
Gg takes its geometric value: Gg = 1. 


The contracted Bianchi identities (2.3) show that the Einstein 
equations are compatible only if the tensor T satisfies the equations, 
called conservation laws, 


Vat? = 0. (IV.2.6) 


We have written in Chapter III the equations of motion of the simplest, 
most common energy sources on a general Lorentzian metric of arbi- 
trary dimension by using their expression in Special Relativity and the 
Einstein equivalence principle. 


IV.2.3 Matter sources 


In the case of dust (incoherent matter, also called pure matter), 
the stress—energy tensor is 


Tog = TUats, (IV.2.7) 


with u the unit flow velocity and r the rest-mass density. 
In the case of a perfect fluid, the stress—energy tensor is 


Tap = Uap + p(gas + Uaug), (IV.2.8) 


with u again the unit flow velocity and with u and p respectively the 
energy and the pressure densities, both of which are non-negative in 
classical situations. The equations of motion of the phenomenological 
quantities u,p and u (see Chapter III) imply that T°? satisfies the 
conservation law 


VaT”? =0. (IV.2.9) 


Note that we could have made the converse argument and deduced 
the equations of motion of the sources from the conservation laws im- 
plied by the Einstein equations satisfied by the spacetime metric. In 
Einstein’s theory, in contrast to Newton’s, the equations of motion are 
a consequence of the equations satisfied by the gravitational potential. 


IV.2.4 Field sources 


For an electromagnetic field, the stress—energy tensor is the Maxwell 
tensor 


1 
Top = Fo Fay — gas F Fu. (IV.2.10) 
The Maxwell tensor satisfies the conservation laws 
Var? =0 (IV.2.11) 


if the 2-form F that represents the electromagnetic field satisfies the 
Maxwell equations of General Relativity 


dF=0 and V-F=0. (IV.2.12) 
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There exist other possibilities 
(Gauss—Bonnet equations) in higher 
dimensions, which seem to play a role 
in string theory. 


12Calling it ‘the greatest blunder of my 
life’, because its value was neither logic- 
ally nor experimentally determined. 


13Not to be confused with the Ein- 
stein spaces of geometers, which are 
Riemannian spaces of constant curva- 
ture. 


IV.3 The cosmological constant 


It is known that the only tensorial operator for a metric g on a four- 
dimensional!! spacetime that is second-order and quasilinear is the 
Einstein tensor with the possible addition of a linear term Ag, with A 
an arbitrary constant, called the cosmological constant, which Einstein 
did not include, because it has no a priori physically fixed value. The 
Einstein equations with cosmological constant A read 


1 
Einstein(g) + Ag = Ricci(g) — x9 R(9) +Ag=T. (IV.3.1) 
In local coordinates, these take the form 
1 
Sop } Agag = Rap Jag R t Agag = Top, (IV.3.2) 


2 


where T is the stress-energy tensor considered before, supposed to rep- 
resent the density of all the energies, momenta and stresses of the 
matter and classical field sources. Einstein introduced the cosmological 
constant when looking for a stationary model for the cosmos (see Prob- 
lem IV.11.1) and removed it!” after the interpretation of astronomical 
observations as showing the universe to be expanding. It is again in- 
cluded by cosmologists, although its value is controversial and may have 
been different at different epochs. It is generally considered to be very 
small now but to have been large in the early universe. In fact, the 
tendency now is to put A among the sources, interpreting —Agag as an 
energy-momentum tensor of the vacuum generated by quantum-particle 
processes; the constant A is then replaced by a function of a new field ¢ 
satisfying a wave equation with a non-linear potential. A ‘cosmological’ 
term also appears in supergravity and string theories. 
The equations (3.2) on an n + 1 = d-dimensional spacetime are 
equivalent to the following: 
d TY 


Rag = Pap, With pag = Tap (= z4 TA 5) Jag.  (IV.3.3) 


IV.4 General Einsteinian spacetimes 


A spacetime of General Relativity is a pair (V, g), with V a differentiable 
manifold and g a Lorentzian metric on V, both a priori arbitrary. It is 
called an Einsteinian spacetime!’ if the metric g satisfies on V the 
Einstein equations with source a phenomenological 2-tensor T assumed 
to represent all the non-gravitational energies, momenta and stresses: 


Einstein(g) =T, ie. Sag = Tyg, (IV.4.1) 


V-T=0, ie V,T =0. (IV.4.2) 


The definition of T is clear in the cases mentioned in Section IV.2.3 and 
IV.2.4, but is not so clear and is still controversial for more complicated 


IV.4 


sources, even in the classical cases of dissipative fluids with or without 
heat current and of electromagnetic fields with inductions. Coupling with 
quantum fields raises the deep problem of quantum gravity. New and 
mysterious fields are now being discovered in cosmological studies (see 
Chapter VII). 

Two isometric! 
identical. 

The spacetime is called a vacuum spacetime if T = 0. 


4 spacetimes (V,g) and (V',g') are considered as 


IV.4.1 Regularity 


In an Einsteinian spacetime, the manifold V can be assumed to be C° 
without restricting its generality.'!° We have already said that we assume 
in this book that the metric is as smooth as necessary for any given 
statement to be true. It is clear that if the source T is discontinuous, 
then the metric g cannot be C?. Using the definition of generalized 
derivatives, it may be possible to define the Einstein tensor!® for metrics 
that do not admit second derivatives in the usual sense. 

The largest simple Sobolev Hilbert space for the components of the 
metric g for which the components of the Riemann tensor in four di- 
mensions are almost everywhere defined functions on a coordinate patch 
diffeomorphic to a ball Q of Rt is H3(Q), the space of functions that, 
together with their generalized partial derivatives of order up to 3, are 
square-integrable on 2. 

For physical interpretation, one is led to split, at least locally, space 
and time and consider coordinate patches Q = w x J, with w a ball in R3 
and J an interval of R. The restriction to w of gag is then assumed to be 
in H2(w), and hence uniformly continuous on w, while the restrictions 
of the first derivatives Ogag/Ox* and second derivatives 07. gag /0x*0x" 
belong respectively to Hı (w) and L?(w). The restriction of the Riemann 
tensor is then defined as a function in L?(w). For more comments, see 
Chapter VIII on the Cauchy problem. 


IV.4.2 Boundary conditions 


The Einstein equations are partial differential equations, and so their 
solutions are linked with the data of boundary conditions. We will see 
in Chapter VIII on the Cauchy problem that, even in a vacuum, the 
Einstein equations present both elliptic aspects like the Poisson equation 
of Newtonian gravity and hyperbolic aspects like the wave equation of 
light propagation. We just mention here that in the primary cases the 
manifold V of an Einsteinian spacetime (V, g) is a product M x R, with 
M spacelike and R timelike. 

For cosmological studies, the case of a compact manifold M without 
boundary has been used, although this seems to be in conflict with recent 
observations hinting at a flat universe—but cosmology is a very debated 
subject. 
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14That is, such that there exists a dif- 
feomorphism f of V onto V’ such that 
fxg! =g. 


151¢ is known that any C1 manifold can 
be given a C° structure that induces 
this C! structure. 


16This possibility is limited, however, 
by the nonlinearity of the Riemann 
curvature tensor. 
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17This model is only an approximation, 
like all mathematical models of phys- 
ical situations. The system considered 
as isolated is in fact embedded in a 
curved universe. 


18Higher-dimensional asymptotically 
Euclidean manifold can also be defined. 


19 Asymptotically Euclidean mani- 
folds with several ends can also be 
considered, with just some additional 
complications in writing down the 
appropriate equations. 


20For metrics approaching the Eu- 
clidean metric in weighted Hölder 
spaces, see Choquet-Bruhat (1974) and 
Chaljub-Simon and Choquet-Bruhat 
(1979), and for those in weighted 
Sobolev spaces, see Cantor (1979), 
Cantor and Brill (1981), and Choquet- 
Bruhat and Christodoulou (1981). 


In the astrononphysical context, particularly relevant situations that 
can be confronted with observation are isolated systems of a few bodies 
far from any other source of gravitation, for example the Solar System 
and binary star systems. In these cases, it is legitimate to assume that 
(M, g), with g the Riemannian metric induced on M by the spacetime 
metric, tends to be flat far away from the studied system. It is then 
assumed to be an asymptotically Euclidean manifold, which we define 
below.!7 

A three-dimensional’® asymptotically Euclidean Riemannian 
manifold (M,4@) is defined as a smooth manifold M union of a com- 
pact set and ant? end Me"4 diffeomorphic to the exterior of a ball of 
R?. This end is given local coordinates xt, x”, 2° by this diffeomorphism. 
We denote by r,0,¢ the corresponding polar (pseudo) coordinates, 
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1 
r:= ae? | *. The Riemannian manifold (M, g) is called asymptot- 
ically Euclidean if in Me"? it holds that 

gij = 6g + O(r*), Ogiz = O(t»), 
with the notation 


ae oe g 


OP i= 
 (Oat)*: (Ax?) 2 (3x3) 


|k] = kı + ko + k3, 


and where a function f is said to be O(r7*) if r f is uniformly bounded 
on M®™®4, The useful maximum order of the derivatives appearing in the 
definition depends on the circumstances. Variants of hypotheses on the 
behaviour of metrics at infinity appear in the solution of problems on 
asymptotically Euclidean manifolds.?° 


IV.4.3 Physical comment 


Even in the vacuum case, there exist many Einsteinian spacetimes. They 
have underlying manifolds with different topologies, as well as different 
metrics on each of these manifolds. A great number of exact solutions 
of Einstein equations possessing isometry groups have been constructed, 
in vacuum or with various sources. Some of these spacetimes have (at 
least at present) a purely mathematical interest, but a few of them are 
models of known physical situations, at different time or space scales. 
There is no universal Einsteinian spacetime as a model for reality— 
this is in disagreement with Newton’s concepts, and also with Special 
Relativity. 

Before giving the main lines of construction and properties of gen- 
eral Einsteinian spacetimes, we justify their validity by their possible 
approximation by the Newton spacetime for sources with low velocity, 
and we give for weak gravitation the approximation of Einsteinian space- 
times by Minkowski spacetime. The Minkowskian approximation reveals 
a property of gravitational fields that has no analogue in Newton’s the- 
ory, namely the existence of gravitational waves and their propagation 


with the speed of light—this property eliminates the need for ‘action at 
a distance’ that was so puzzling for Newton. 


IV.5 Newtonian approximation 


We have said that Einstein’s theory of gravitation must nearly coincide 
with Newton’s in the physical situations where the latter proved to be 
accurate. These situations are slow (compared to the speed of light) 
velocity of the gravitating bodies with respect to Newton spacetime and 
weak gravitational fields. In Einstein’s theory, the spacetime coincides 
with Minkowski spacetime in the absence of gravitation, that is with 
the manifold R? x R endowed with the flat metric, which, in inertial 
coordinates and geometrical units, reads as follows: 


3 
—(dx®)? + Sa. 
i=1 
An Einsteinian spacetime with a weak gravitational field will be R? x R 
with a metric such that 
goo = —1 + hoo, goi = hoi, gij = ij + hij, 


with hag of order e << 1. 

In a spacetime with masses moving slowly (compared with 1, the speed 
of light), the components hia are small with respect to hoo and the time 
derivatives ðohag are small with respect to the space derivatives Ojhag; 
these are considered to be of the same order as hag. 


Exercise IV.5.1 Justify these statements by considering units where 
c=3 x 10°km/s. 


Hint: Change the timescale by seting dz? = cdt. 


IV.5.1 Determination of Gr 


The component Roo in the considered approximation is equivalent to 
5 poa 1 
Roo = ôi o0 = — gAhoo, 


with A the Laplace operator of Euclidean space. In Newtonian the- 
ory, the gravitational field is the gradient of the Newtonian potential, a 
function U that satisfies the Poisson equation: 


AU = —pG = —41Gnp (IV.5.1) 


where p is a positive scalar function on R? equal to the mass dens- 
ity of the sources and Gy is the Newtonian gravitational constant (see 
Chapter II). 

In Einstein’s theory, the energy sources are represented by the stress- 
energy tensor. Under usual circumstances, the most important energy 
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source is pure matter, i.e. the stress-energy tensor is of the following 
form, with u some positive scalar function and u the unit flow vector: 


Top = Ug. 
In the case where the matter has a small velocity (with respect to the 
speed of light, taken to be c = 1) for observers following the time lines 
of the coordinate system, it holds that ug is equivalent to —1 and u; to 
zero; therefore, Tia is negligible and 
1 _l 
Poo = G 2 (Too = 59007) = 3 CEH: (IV.5.2) 


We see that in the approximation that we are making, called the 
Newtonian approximation, the equation Roo = G gpoo gives 


Aho = — Gen, 


that is, the Poisson equation of Newton’s theory for the gravitational po- 
tential if hoo is identified with 2U and one sets Gg := 8nG y. This is the 
reason why many physicists write the Einstein equations in dimension 
3+ 1 as 


Sap = 8tGnTog- (IV.5.3) 


The value of the Newtonian gravitational constant depends on the 
choice of units for time, length, and mass. As we said before, Gy is 
approximately, in CGS units, 


G N cm3g-1s-2 = 6, 67259 x 107-8. 


Of course, making Gz = 1 instead of Gy = 1 implies a change by a 
factor 87 in the unit of mass-energy. 

In this book, we write the Einstein equations in geometric units, i.e. 
Sag = Tag, introducing the relevant scale factor to the usual physical 
units only when this is useful for interpretation of observations. 


IV.5.2 Equations of motion 
In Newton’s theory, the equation of motion of a test particle is 
drt OU 
d? Oa" 
In General Relativity, a test particle follows a geodesic of the spacetime 
metric, i.e. 
Ga” sa drò dz 
ds? ` "ds ds 


For small velocities, we have already seen that dz?/ds~ 1 and 
dzt/ds ~ 0; therefore, 


nl 
~ Tio ~ ziho: (IV.5.4) 


We see that the Einstein law coincides with the Newton law in the 
Newtonian approximation where hog = 2U. 


IV.5.3 Post-Newtonian approximation 


One can improve the Newtonian approximation by retaining further 
terms in the expansions. This leads to the so-called post-Newtonian 
(PN) approximation. To make these approximations, it is more conveni- 
ent not to replace the gravitational constant and speed of light by their 
geometric value 1. 

Let us briefly explain the structure of the first post-Newtonian (1PN) 
approximation, where one keeps all the corrections to the Newtonian 
approximation containing one power of 1/c?. There are three sources 
of such 1/c? corrections: (i) in algebraic (v/c)? terms, where v is a vel- 
ocity variable; (ii) in U/c? terms, where U is a Newtonian-potential-like 
variable; or (iii) in time-derivative terms 03 = c~? 0?. At the 1PN ap- 
proximation level, one must go beyond the linearized approximation to 
the Einstein equations, and keep some of the quadratically nonlinear 
terms in the time-time Einstein equation 


1 
R® = 8r Gye“ ee ere ro) . 
n— 


A convenient way of doing so is to parametrize the metric in the following 


exponential form:?! 


Joo = 
4 
Joi = — -3 Vi (IV.5.5) 
c 
2 
Jij = exp ta Tij- 


Inserting this form in Einstein’s equations, and using for simplicity wave 
coordinates, one finds (in dimension n = 3) that the auxiliary metric yij 
must be flat modulo corrections of the next PN order O(1/c*), i.e. 


1 
Vag = Dij + O (=) , 


In addition, one finds that the gravitational ‘scalar potential’ V and 
the gravitational ‘vector potential’ V; satisfy the following simple, linear 
equations: 


(IV.5.6) 


1 
AV — = 8V =-4rG 
aa aya (IV.5.7) 


AV, = -4r Gyo}, 
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21 Blanchet and Damour (1989). 
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22See, for instance, Will (2014). 


?3See Brumberg and Kopejkin (1989) 
and Damour, Soffel, and Xu (1991, 
1992). 


where the source terms ø and gf are simply given in terms of the 
contravariant components of the stress-energy tensor by 


T00 TSS 
os (IV.5.8) 
c 
, To 
g = . 
c 


The linear form of the equations (5.7) for V and V; allows one to solve 
them explicitly in terms of o and o;. At the 1PN approximation, one 
can write 


to’) 1 1 
V(t, £) = Gy f aa’ i a + za Of olta’) |x —2'| + O (=)| 


|z- z 
4 / 

Vi(t,z) =G fett +0 : 
la — z'| Ce 


(IV.5.9) 


The approximate form of the metric obtained by inserting 
Eqs. (IV.5.9) in to the expressions (5.5), the first of which explicitly 
reads (upon expanding the exponential to show the level of nonlinearity 
actually contained in the 1PN approximation) 


1 

go =-|1-3V+5V"4 o(=)| , (IV.5.10) 
is sufficiently accurate for describing all current experiments and obser- 
vations in the Solar System, including perihelion advances of planetary 
orbits, the bending and delay of electromagnetic signals exchanged 
between the Earth and planets or satellites, and the very accurate 
laser ranging data to the Moon. Let us, however, note several different 
extensions of the above 1PN formalism. 

First, to compare not only General Relativity but also different (non- 
Einsteinian) gravitational theories with experiments and observations, 
an extension of the above general relativistic 1PN metric, containing a 
whole collection of adjustable parameters, has been introduced, and is 
called the parametrized post-Newtonian (PPN) formalism.?? 

Second, when developing a general relativistic theory of the motion of 
an N-body system (such as the Solar System), it has been found useful 
to generalize the PN formalism into a multichart approach to general 
relativistic celestial mechanics.?? In contrast to the traditional one-chart 
approach to the general relativistic N-body problem (in which a single 
coordinate system is used to describe both the gravitational field and 
the motion of N bodies), the multichart approach uses N + 1 coordinate 
systems: a global coordinate system xz” and N local coordinate systems 
X4% (with A = 1,2,...,N), each of which is attached to one of the 
N bodies. This multichart approach has been found useful not only in 
the Solar System, but also in binary systems containing strongly self- 
gravitating objects such as neutron stars or black holes (see Section VI.6 
in Chapter VI). 


IV.6 


IV.6 Minkowskian approximation 


To compute the Minkowskian approximation of solutions of the Einstein 
equations, one considers Lorentzian metrics on R"+! with components 
in some coordinate frames: 


Jas = Nap + hag, 


with jag = diag(—1,1,....,1). One assumes hag and its derivatives to 
be small with respect to 1; their products are then even smaller. 


IV.6.1 Linearized equations at 7 


A first approximation is obtained by neglecting in the Einstein equations 
all terms containing products of the perturbation h, that is, by consid- 
ering a solution of the Einstein equations linearized at 7. The following 
linearizations are particular cases of those computed in Chapter I and 
result easily from the expressions for the Christoffel symbols and the 
Ricci tensor when products of h’s and of their derivatives are neglected: 


1 1 
(TÀo)n(h) = zlu, ap = a (Qahye — uha) 


il 
(5Raa)n(h) = —sOhap + 3a fg + Os fa (IV.6.1) 
where = nò}, with 0, := 0/0x*%, is the Minkowskian 


d’Alembertian operator and fa is the set of functions (indices are raised 
with the Minkowski metric n) 


1 1 
fot= (ha = Oh) Vi (IV.6.2) 
2 2 
The decomposition (6.1) is non-tensorial; the conditions f“ = 0 can 


be satisfied by choice of coordinates. 


Exercise IV.6.1 Show that the equations f° =0 are equivalent to the 
linearized wave equations for the coordinates x“, i.e. 


f> = (F^) (h), with F> — Fog = g Tig: 


Hint: 
(IPT) = n°2 OP a) la: 


IV.6.2 Plane gravitational waves 


The formula (6.1) shows that if f“ = 0 the first approximation is solution 
of the linearized Einstein equations at 7 in vacuum, (Rag) (h®) = 0 
which reduce to 


D aua gt: I o 2 | a 
nly = mro hN = | ap? 2 Gai begat.  (TV.6.3) 
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These equations are a system of ordinary wave equations for the per- 
turbation A“); the solutions propagate on Minkowski spacetime with 
velocity 1, i.e. the speed of light. 


In the case of a Minkowski background, the equations A = 0 imply 
trivially Of, = 0. However, Ofẹ = 0 implies fẹ = 0 only if fa satisfies 
appropriate initial data. We will study such problems in Section IV.7 
for the full Einstein equations. Here, for weak fields, we will proceed 
directly, considering the general plane waves 


nO) = capy(anx), with cag and ay constants. 


Such a perturbation of the Minkowski metric is a solution of the 
linearized Einstein equations at 7 if 


hG) = coslly = Cag [ca + Deo y"(ayx*) =0 (IV.6.3a) 


and (linearized gauge condition) 
2fa =a°Xaap (axr) =0, with xa. :=Cor— Imaxe. (IV.6.3b) 
Equation (6.3a) implies that aa is a null vector, 

(ao)? — X (ai)? = 0, (IV.6.4) 
while (6.3b) implies that the constant tensor Cag satisfies the polariza- 
tion conditions 

a*Xor = 0, 


which are n+1 linear and homogeneous equations with (n + 1)(n + 2)/2 
unknowns xy. The wave is said to be transverse-traceless, abbrevi- 
ated to TT, because by a first-order change of frame one can reduce the 
polarization conditions to the equations 


Coo = Coi = 0, ro = 0, and a'ci; = 0. 
Exercise IV.6.2 Prove this statement. 


The polarized cag span a vector space of dimension (n + 1)(n — 2)/2: 
in the classical dimension n = 3, weak gravitational waves have two 
degrees of polarization. For example, choose a frame such that ag = 
a3 = 0. The equations above then imply cy; = 0 and c22 + c33 = 0. 
Examples of oscillating weak gravitational waves are 

hul =0, except for Af; Aip, and AGY, 
with 


hee = cz sin(wt — wa"), ny = =A = cy sin(wt — wrt). 
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IV.6.3 Further results on gravitational waves 


We have seen that in situations where one can apply the linearized ap- 
proximation to Einstein’s equations, a system emits gravitational waves 
at infinity, which are given, to the lowest approximation, in terms of the 
quadrupole moment?‘ of the system. More precisely, in a TT coordinate 
system, the metric perturbation far from the system reads, to lowest 
order, 


2Gy 1 ll d r ji 

TT N 

hij (t,£) = a F (Pa Pa- 5 Pu Pu) T2 Iri (t =) | o(). 
(IV.6.5) 


where P;; = ðij—ni nj is a spatial projector orthogonal to the unit vector 
i — 


n? = x'/r and where the quadrupole moment of the system, I[;;(t), is 
given (to lowest approximation) by 


T00 4 s f a 
L(t) = jae ee) = r — 3 a) (IV.6.6) 


C 


The recent development of kilometre-size interferometric detectors 
of gravitational waves (LIGO, VIRGO, etc.) has provided strong mo- 
tivation for improving the theoretical treatment of the generation of 
gravitational waves by astrophysically realistic systems. As many of 
these systems (such as binary neutron stars or binary black holes) 
contain strongly self-gravitating systems, one needs to use the multi- 
chart formalism sketched in Section IV.5.3. In addition, it has been 
found necessary to (i) include higher multipoles than the quadrupole 
Iij, (ii) include higher post-Newtonian corrections to the Newtonian- 
level quadrupole formula, and (iii) push the post-Newtonian expansion 
to the highest possible approximation. These developments, however, are 
outside the scope of this book.?° 


IV.6.4 Tidal force 


The influence of a gravitational wave on the trajectory of a single isolated 
particle, as noted before, cannot be observed. The tidal force due to a 
gravitational wave acting on a pair of particles, i.e. the geodesic deviation 
equation (see Chapter I), depends on the Riemann tensor. Considering 
two nearby particles with initially parallel unit velocity u“ = ôf and 
spatial separation vector X’, the equation for the geodesic deviation 
force reads 


UUP Va Vg Xİ = X wu ROL, = XIR oz, (IV.6.7) 
with, at first order, 
u uf VaV oX’ = 02 X* 
and 


; _ Oo 
RO oj = spo. = satt, _ ar T (IV.6.8) 


24See Problem IV.11.5. 


?5For the generalization of (6.5) to the 
infinite sequence of higher multipoles 
see Damour and lyer (1991). For a 
review of the current theory of gravi- 
tational radiation from post-Newtonian 
sources, see Blanchet (2014). See also 
Damour and Nagar (2011). 
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26 Wentzel-Kramers—Brillouin. 


In the TT gauge, where no) = 


1 
doh, (IV.6.9) 
and hence 
1 
ORO = — 5 Abohis 


We see that the equation of geodesic deviation implies at first order of 
the Minkowskian approximation that 


Cre aan xin) Siji 


IV.6.5 Gravitational radiation 


The word ‘radiation’ usually refers to energy transfer without material 
support. 

In the preceding subsections, the approximations used were mathem- 
atically well defined and physically well understood. By analogy with 
other fields, in particular electromagnetism, one expects gravitational 
waves to carry energy. However, as we have already mentioned, there 
is no pointwise intrinsically defined gravitational energy. At best, it is 
possible to define some non-local quantities depending also in general on 
another a priori given metric—quantities that possess some properties 
analogous to those of the energies of other fields. Much important and 
very complex work, analytical as well as numerical, has been (and is 
continuing to be) devoted to the problem of gravitational radiation 
energy, but some fundamental questions are still open. 


IV.7 Strong high-frequency waves 


IV.7.1 Introduction 


It is of course possible to define weak gravitational waves as perturba- 
tions of a given non-flat Lorentzian metric and to obtain for them linear 
equations. It is more interesting to study nonlinear effects by extending 
to nonlinear equations the WKB?® anzatz, first used to study approxi- 
mate rapidly oscillating solutions of the Schrödinger equation in the 
form 


u(x) = a(x)e?@) , (IV.7.1) 


where x is a spacetime point and a and ¢ are functions on spacetime. 
The function ¢, called the phase of the wave, and w, a number called the 
frequency, are real. The number w is assumed to be large compared with 
the values of the functions a and @¢ and their derivatives with respect 
to x. Such an approximation is called in mechanics a high-frequency 
wave, or a progressive wave with wave fronts (x) = constant; the 
function u varies more rapidly in directions transverse to the wave fronts 
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than on the wave fronts. The study of progressive waves, also called the 
two-timing method, has many applications in classical as well as in 
quantum physics. 

The WKB ansatz, or its generalization by Lax to asymptotic series, is 
not well adapted for the study of high-frequency waves associated with 
nonlinear equations, because the product of two functions like (7.1) is 
not of the same type.?’ J. Leray?® and his collaborators introduced for 
linear equations a more general anzatz, replacing e’”?") by a general 
function of x and w(x). For quasilinear equations, it is then possible”? to 
construct high-frequency waves that are approximate solutions in a well- 
defined sense. The effects of the nonlinearity on the waves are possible 
distortion of signals and the appearance of singularities similar to shocks. 

In the case of General Relativity, where the unknown is a Lorentzian 
metric on a manifold V, a high-frequency gravitational wave in vac- 
uum is a Lorentzian metric that is the sum of a non-oscillating part g 
and a rapidly varying one depending on a large parameter w called the 
frequency: 


g(x, wo(x)) = g(x) T h, h := {uT u(x, €) + w w(x, E) Jezwele) 
(IV.7.2) 


We say that (7.2) is an asymptotic solution of order p of the vacuum Ein- 
stein equations if it is such that w?” Ricci(g(x,wo(x)) remains uniformly 
bounded”? as w tends to infinity. 

To write the asymptotic expansion of the Ricci tensor of the metric 
(7.2), we set?! 


hence Oh=O(g—g), Oh=6(g—g). (IV.7.3) 


h := ôg := 9-49, 


The Taylor formula gives the expansion 


1 
Ricci(g) — Ricci(g) = Ricci + 59 Ricci +., 


(IV.7.4) 


with 


Ricci = Ricci} -h + Riccig, - Əh + Ricciga, - 07h. 


: eri er PR ani 
The coefficients Ricci}, Ricci}, Riccty,, and Ricciys ó depend only on 


the background metric g. The second derivative 6?Ricci, a quadratic 
form in (h, ðh, 07h), is computed analogously. We remark that 6? Ricci 
does not contain the square of 07h, because Ricci(g) is linear in 07g. 
To compute ðh and 07h, we use the definition of h. We denote by a 
prime a derivative with respect to € and underline partial derivatives 
with respect to x. We set y) := ap = y. Elementary calculus implies 


Inhaa(x, wH(x)) = {avya(z,€) + w*[A.vaa(x,€) + drwog(a, €)] 
tw", wWeaa(a, £) keswa 


27Isaacson applied the original WKB 
method to the linearized Einstein equa- 
tions and then looked for a solution of 
equations with source a stress—energy 
tensor obtained by averaging the ob- 
tained perturbation. Progressive waves 
of the type (7.2) for the nonlinear 
Einstein equations are constructed in 
Choquet-Bruhat (1969a). 


°8Garding, Kotake, and Leray (1966). 


29Choquet-Bruhat (1969b). For coup- 
ling of high frequency gravitational and 
fluid waves see Choquet-Bruhat and 
Greco (1983). 


30The added term in w~? is introduced 
to ensure rigorously the asymptotic 
character of the constructed solution. 


311t is also possible to write a 
coordinate-independent formula by 
introducing a given metric e. Note 
that Ricci’ 52 and Ricciggag are 
independent of the choice of the given 
metric e. 
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32That is, if its gradient is a null vector. 
This condition is necessary for the wave 
to be significant. 


and a corresponding formula for the second derivative, 
DIwhap = WPAP LVB + PAV + Pua + ARTA 
+ PAPuWagB +w OTe Sci 
with 
Rainer = 4 [Hues + bawan + (Yu, + 9r.9,,) We 
tw '0)9,, Wal (2, £) ies os 


IV.7.2 Phase and polarization 


The asymptotic expansion in w of the Ricci tensor is obtained by re- 
placing the metric and its first and second derivatives by the above 
expressions. We see that this expansion starts with a term in wt, whose 
coefficient must vanish for g to be an asymptotic solution of order zero. 
We thus obtain the equations 


Ricctg gPreuv" = 0. (IV.7.5) 


This is a linear homogeneous system for the second derivative with 
respect to the parameter £ of the tensor v, which reads, in coordinates, 
1 1 
— JO Pras + 7 [Pa Palv) + pp Pa(v’)| = 0, (IV.7.6) 
with 


1 
P,(v') = yu, — PoV A vÀ, 


By analogy with what we saw in the Minkowskian approximation, we 
call P, the polarization operator. We state as a theorem the result 
we have obtained. 


Theorem IV.7.1 If the phase is isotropic,®* then the necessary and 
sufficient condition for the progressive wave to satisfy the Einstein equa- 
tions at order zero in w is that the tensor v satisfies the four polarization 
conditions 


P,(v) = 0. 


Exercise IV.7.1 Prove that the polarization conditions express the 
vanishing at order zero of the perturbation of the harmonicity functions 
g TS 

H 


Hint: For an arbitrary metric, the following identity can be proved by 
straightforward computation (see Chapter VIII on the Cauchy problem): 


Rag = RË? 3) + Lap; (IV.7.7) 
with 


Lop = = (garða F> + 98,00F>) , (IV.7.8) 


NI rR 


IV.7 


F%= Cae) ae = GV yður” = z“, 


while the a are a system of quasilinear, quasidiagonal (i.e. linear and 
diagonal in the principal, second-order, terms) wave operators, 


1 
RER = 59" By Gas + Paa(g)(O9,09), (IV.7.9) 


where P is a quadratic form in the components of Og, with coefficients 
polynomial in the components of g and its contravariant associate. 


IV.7.3 Propagation and backreaction 


Using previous results, or by direct computation, we find that the 
coefficients at zeroth order in w in the expansion of Ricci(g) are 


AD = Rog + (Pv')ag + Liv’, wag + Nagl, v, v"), 
with P a linear propagation operator along the rays of Oy, namely 


1 
(Pv')ag =- GA + T ; 


while L reads 


Lap = 5 [eaQalv’) + Qal) + 5 lea Palu") + pgPalw")], 
with 
Qalv!) = Byv'd — Favà. 
The nonlinear term N(v,v',v”) comes from ô°Rag. We find that for 
polarized v it reduces to 


1 1 1 1 
Nog(v,v',v") = =Pave ay — ee + 5 Gace — TA ; 
We can now prove the following theorem. 
Theorem IV.7.2 The progressive wave 
Iag (T) + {wT tvag(x, E) + w7’ waplt, E) feawy(e) 


is an asymptotic solution of order one of the vacuum Einstein equations 
on a manifold V under the following hypotheses: 


(1) The phase ọ is isotropic for the background g. 


(2) v satisfies the linear, non-homogeneous, propagation system 


P(v') =0 


Strong high-frequency waves 


along the rays? of the phase y and also satisfies the polarization ?3Null curves tangent to Ov. 


conditions on a hypersurface % transverse to these rays, assumed to 
span V. The field v is periodic in € on X. 
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(3) w is a solution of the linear system 


1 1 
P,(w") = Qalu) + qeelv ny = vivy + zeal — E). 


(4) The background metric g satisfies the Einstein equations with source 
a null fluid: 


Rag = Epapg, 


where, with T the period of v, 


1 l 
E= Lf n E(.,6)dé, with E= 3 ( Mea — T : 
Proof. 


(1) If v’ satisfies the propagation equations P(v’) = 0 on V x R and the 
polarization conditions on © transverse to rays that span V, then it 
satisfies the polarization conditions on V x R because the equation 
P(v') = 0 implies the propagation both of p*vgg = 0 and vå = 0. 
Indeed, 


°° (Pu! op = —pOyuy + og `a pÀ 
and also 
p% (Puas = -03 (pva) — Lg vapO.9 
because if Ya is a gradient and isotropic, then 


0 O\Ga = PaPa = O. 


The coefficient of w in the asymptotic expansion of Ricci(g) is 
therefore zero. 


(2) The function x — v'(x, €), a solution of the linear differential equa- 
tion P(v’) = 0 with coefficients independent of £, is determined on 
V x R if it is known on a submanifold transverse to rays that span 
V. It has period T in € if its data on the submanifold have period T. 


(3) When v is known, the equations for w” are non-homogeneous linear 
equations, namely 


Log + Nag + Rag = 0. 


(4) If v and w’ have period T in €, then the following relation holds, 
because linear terms in v’ or w” integrate to zero: 


T 
TRag = -f Nag(-,€) dé. 


An elementary computation gives 


Nog =—Palp 


from which there follows the expression given for R and hence the 
linear system for w” given in the theorem. These linear equations 
have solutions on V x R, periodic with period T in €, because the 
linear transposed homogeneous system has an empty kernel and the 
coefficients have period T. The tensor w can be chosen also of period 
T in € and the right-hand side has a zero integral on € on the interval 
Veta. 


Note a most remarkable fact, which is not shared by other nonlinear 
fields: the vector v obeys linear propagation equations; that is, there is 
no distortion under propagation of gravitational signals, in spite of the 
nonlinearity of the Einstein equations. 


IV.7.4 Observable displacements 


As in the case of weak gravitational waves, the observable displace- 
ments due to a strong gravitational wave are governed by the tidal force 
determined by the highest-order terms in w of the Riemann tensor. 


Exercise IV.7.2 Consider two nearby particles with initially parallel 
unit velocity u“ = dg and spatial separation vector X'. Write the first 
approximation of the geodesic deviation due to a strong high-frequency 
wave. 


IV.8 Stationary spacetimes 


IV.8.1 Definition 


We call stationary an (n+ 1)-dimensional spacetime (V, g) that admits 
a one-parameter group G1 = R of isometries with timelike Killing vector 
E whose orbits are diffeomorphic** to R and span the manifold V. More 
precisely,?° we assume that V is a product M x R, a point of V is a pair 
(x,t), x E€ M, t € R, the subspaces M x {t} of constant t are spacelike 
submanifolds, and the Killing vector € is represented by 0/0t. In frames 
adapted to the product structure, the spacetime metric g reads 


g = Y?’ (dt + a)? +9, (IV.8.1) 


with ¢ € R a time coordinate on the orbits of the vector € = 0/0t; 
w, a, and g are respectively a t-dependent scalar, a 1-form, and the 
Riemannian metric on M. In local coordinates z in the domain of a 
chart of M, one has 


a= aidz, g=gijdz'dz’, (IV.8.2) 


with all coefficients in g being independent of t. 
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34We exclude the case of orbits diffeo- 
morphic to St, i.e. the case of closed 
timelike curves, because this is con- 
sidered non-physical. 


35The assumptions that V is a product 
M x R could be relaxed in the defin- 
ition of stationarity, but the uniqueness 
theorems given below would not neces- 
sarily hold. 
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36See, for instance, CB-DMI IV. 


A stationary spacetime is called static if the orbits are orthogonal to 
the n-dimensional space manifolds. Its metric then reads 


g = —N?dt? +g. (IV.8.3) 


Static spacetimes are considered to represent equilibrium situations, 
while stationary spacetimes model permanent motions. Both play an 
important role in relativistic dynamics. 


Lemma IV.8.1 A stationary spacetime is static if the 1-form a is an 
exact differential: 


a= do, 1.€. Qi = iO, 
with @ a t-dependent scalar function on M. 


Proof. For a stationary spacetime, assume that a = dọ, and consider 
the change of its time and space factorization under the change of time 
parameter 


t=t+ġ. (IV.8.4) 

This puts its metric into the static form 
g = —N? dt? + gijdz'dat, (IV.8.5) 
with g and N = w independent of t. Oo 


Static spacetimes are invariant under time reversal, t — —t. This 
property can also be taken as a definition to distinguish static spacetimes 
among stationary ones. 

One calls locally static those stationary spacetimes for which the 
1-form a is closed, da = 0, but is not globally an exact form, i.e. there 
does not exist on the whole of M a function such that a = dọ. This can 
happen only*° if M is not diffeomorphic to R”. 


IV.8.2 Equations 


We denote by V and Ricci the covariant derivative and Ricci tensor in 
the Riemannian metric g. One can show by straightforward computation 
that the components of the Ricci tensor of the spacetime metric 

g := —W (dt + aidz’)? +9, with g := gijdx'dz!, 
reduce, with f = da, to 


Rij = Rij + 2 Fi” fin w'Vid0;v, with f? := gI (V kai — Viak), 
(IV.8.6) 


ms -a E, (IV.8.7) 


R= Tytti + WAG. (IV.8.8) 


Exercise IV.8.1 Prove these formulas using the general formulas for 
connection and curvature given in Chapter I. More general formulas are 
derived in Chapter IX. 


In the case of a locally static spacetime, the components of the Ricci 
tensor reduce to 


Ri; = Rij — V7 Vidi%, (IV.8.9) 
Rio = 0, (IV.8.10) 
Roo = wAgw. (IV.8.11) 


The second identity?” shows that the source must have zero momentum 
for an Einsteinian spacetime to be locally static, justifying its name. 
A reciprocal theorem is easy to prove under a physically meaningful 
hypothesis, namely the following: 


Theorem IV.8.1 A stationary Einsteinian spacetime with sources of 
zero momentum, i.e. poi = 0, is locally static if either 


(1) M is compact. 
(2) The spacetime is asymptotically Euclidean. 


Proof. If po; = 0, a previous identity gives 


oe 
Rio = ~ gy Vili H*) =0, 

and hence 
= nie a 1 oo 
Vila fiiy”) = aU Fi fi. (IV.8.12) 


If M is compact (and hence without boundary according to our 
definitions) integration of the above equality on M gives, by Stokes’s 
formula, 


Y? fi fitum =0, 
M 


and f;’ = 0 since fj) f;’ > 0, 
If M is asymptotically Euclidean, then we have, with B, a ball of 
radius r, and using again Stokes’s formula, 


oe i ae nar 
f Pfi fitum = 5 lim i fe fi up, = lim g a’ fb? nj Hop, 
M roo J p, r> Jap. 
(IV.8.13) 


When the radius r of the ball tends to infinity, the right-hand side tends 
to zero if n > 2, because if the spacetime is asymptotically Euclid- 
ean, then the product af falls off like r~@?+)), 2p +1 > n —1, while 
Has, =r” tusz, where g2 is the volume element of the sphere S"~?. 
We conclude again that f? = 0 on M. E 
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37For a consequence of the last iden- 
tity when M is compact, see Prob- 
lem IV.11.9. 
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38Lichnerowicz (1939). 


39By a theorem of Schoen and Yau, 
a locally flat asymptotically Euclidean 
(complete—a property included in our 
definition) Riemannian 3-manifold is 
isometric to R. 


40See Chapter 3 of Wolf (2011). 


41Fourés (Choquet)-Bruhat (1948). 


IV.8.3 Non-existence of gravitational solitons 


One calls a gravitational soliton a complete non-trivial (i.e. non-flat) 
stationary solution of the vacuum Einstein equations. 

We extend to arbitrary dimensions the proof of a theorem given 
in 3+1 dimensions by Einstein and Pauli in the static case and by 
Lichnerowicz®® in the stationary case. 


Theorem IV.8.2 Jn (n+1)-dimensional spacetimes, the gravitational 
solitons (M x R,g) with M compact or asymptotically Euclidean are 
locally static with Ricci-flat space metric and p a constant. 


Proof. Since gravitational solitons are solutions of the vacuum Einstein 
equations, Theorem IV.8.1 shows that they are locally static: f? = 0. 
The remaining vacuum Einstein equations read 


Az% = 0, (IV.8.14) 


Rij = Y Við;y. (IV.8.15) 


By well-known theorems resulting from the maximum principle applied 
to solutions of elliptic partial differential equations, the equality (8.14) 
implies that ~ = constant on a manifold M that is compact or asymptot- 
ically Euclidean (if ọ is smooth and uniformly bounded—assumptions 
we implicitly include in the definition of gravitational solitons). The last 
equation concludes the proof. E 


Corollary IV.8.1 Under the hypotheses of the theorem, the only 
(3+1)-dimensional gravitational solitons are locally flat. 


Proof. In 3 dimensions, a Ricci-flat manifold is flat. E 


In the classical case considered by Einstein where M is an asymptot- 
ically Euclidean? manifold diffeomorphic to RÌ, the only gravitational 
soliton is Minkowski spacetime M4. 

In the compact case, there are, besides the flat torus T3, five non- 
isometric (i.e. non-diffeomorphic) Riemannian orientable compact flat 
manifolds, quotients of the Euclidean space FÆ? by discrete isometry 
groups.7? 


Remark IV.8.1 Corollary IV.8.1 is not true in higher dimensions. 


IV.8.4 Gauss’s law 


A property of stationary spacetimes deduced easily from the previous 
identities is the following, which we have called*! the relativistic Gauss 
law. 


Theorem IV.8.3 The time-time component Rt of the Ricci tensor of 
a stationary spacetime (M x R, ĝ) in a natural frame with time azis tan- 
gent to the timelines and space axis tangent to the manifold M; satisfies 
the divergence identity 


yR! = yR = V’ | dip + L(a fyb?) : (IV.8.16) 


Proof. The component Rt is immediately deduced from the components 
previously computed in the frame 6° := dt + a;dxt. The change-of-frame 
formula gives 


Ri = Ro + a,Ri, (IV.8.17) 
with 
Ro=-v "Roo, R= ĝ” Roj. (IV.8.18) 


Therefore, 
R=- Fee +0 Ag+ 50 tov (05) ,  (TV.8.19) 


which gives, using the definition of fij, 


Ri = y! [Ase 4 FIE] , (IV.8.20) 


that is, the given identity. Oo 


Corollary IV.8.2 If the space manifold of a stationary Einsteinian 
spacetime with source p is asymptotically Euclidean, then it holds that 


fluxingnity grad Y := im f n'ðipuaB, = Yi uz. 
P> JAB, M 


Exercise IV.8.2 Prove this corollary, which has an analogue in New- 
tonian mechanics, called Gauss’s law. 


IV.9 Lagrangians 


Lagrangians, arising from energies, play a fundamental role in physics. 
The Lagragian formulation of the Einstein equations, found independ- 
ently by Einstein and Hilbert, stands apart, being unrelated to a 
pointwise intrinsically defined gravitational energy. However, it plays an 
important role in many modern developments, in particular for equations 
with field sources. 
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42One equivalently says that a solution 
of the Einstein equations in vacuo is 
a critical point of the Einstein—Hilbert 
Lagrangian. 


IV.9.1  Einstein—Hilbert Lagrangian in vacuo 


In this section we use the results of Chapter I, and its physicists’ 
notation. 


Theorem IV.9.1 The Einstein equations in vacuo are the Euler 
equations’? of the Einstein-Hilbert Lagrangian, defined by 


bgal = f Ra), (v9.1) 
with R(g) the scalar curvature and ug the volume element of the 
Lorentzian metric g. 


Proof. We have 


Lew) = | [R(G) tg + RO): (IV.9.2) 
with 
OR = gP Rag + Ragg” 
and 
1 ab 
Ôlg = ~ 590899 Hg- 
We have found in Chapter I that g°?d Rag is the divergence of a vector, 
9°? bRag =V, vr := Whe —Vah**, hag:= gap, (IV.9.3) 


and hence the integral of this term vanishes if vò vanishes on the bound- 
ary of the integration domain, as is assumed in computing the Euler 
equations. We therefore have 


bLgrav(g) = / [SR(g) Mg + R(g) dp] = J (Ras = T 59°" u(g). 


(IV.9.4) 
E 


Exercise IV.9.1 Prove the formula (IV.9.4). 


Hint: The derivative of a determinant is 0) det g = gð Jag det g. 


IV.9.2 Lagrangians for Einstein equations with sources 


A Lagrangian for the Einstein equations with sources (with the latter 
being generically denoted by ¢) is the sum of the vacuum Lagrangian 
and a Lagrangian for sources. By the equivalence principle, the Lagran- 
gian in General Relativity of a source, L(g,¢) = f L(g, ¢)ug, should 
be deduced from the Lagrangian in Special Relativity, invariant under 
diffeomorphism, by replacing the Minkowski metric 7 by g. 


General theorem 


The vanishing of the variation with respect to g and ¢ of the sum of the 
Einstein—Hilbert Lagrangian and the Lagrangian of the sources gives the 
Einstein equations with source a tensor T, together with the equations 
of motion for the sources. 

The first variation of the total Lagrangian 


ited I [R(g) + L(g, lpg (IV.9.5) 


reads (the dot below denotes some algebraic linear form) 


5Leot(9,$) = | {3 (g) — M°9(g, ¢)]5gap — BC, 4) - 6b}4t, (IV.9.6) 


where = 0 are the equations of motion of the fields ¢ on (V,g) and 
M°® coincides with their stress-energy tensor. 


Remark IV.9.1 The conservation laws VaM®P = 0, necessary for 
the solvability of the Einstein equations with source @, are a consequence 
of the invariance under diffeomorphisms of the source Lagrangian, which 
implies that if P(g, ġ) = 0, then it holds that for any vector X, 


[MPAs +V6Xa)lg = 0. 


Matter and field sources 


In the case of matter sources, it is somewhat difficult to find a La- 
grangian in Special Relativity, and even in Newtonian mechanics. A 
simple-looking Lagrangian for barotropic perfect fluids was proposed by 
Taub.*? It reads*4 


Laid = J p(u)Hg- 


The Taub Lagrangian leads to the Euler equations through a some- 
what involved process, introducing first a Lagrangian with the constraint 
g(u,u) = —1 and a Lagrange multiplier A, and using Lagrangian-type 
coordinates where the timelines are the flow lines. 

Another proposal introducing velocity potentials is due to Schutz.*° 

None of these Lagrangians has been effectively used in the study of 
properties of relativistic fluids. 

In the case of the usual field sources, the Lagrangians in Spe- 
cial Relativity are well known and naturally transferred to General 
Relativity. 

The Maxwell equations in vacuo are the Euler equations with respect 
to a closed 2-form F of the Lagrangian 


1 : 
Lam E g) := -4 | E° Footy (IV.9.7) 


The Lagrangian of a scalar field f with potential V(f) is 


Laca g) = -f [308ta + ven] Hg- (IV.9.8) 
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43Taub (1954, ?). 


44Here u denotes the energy density 
and ug the volume element. 


45 Schutz (1971). 
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46See YCB-OUP 2009, Chapter III, 
Section 6.5. 


The Lagrangians for Yang-Mills fields and wave maps are obtained 
analogously by replacing ordinary products by scalar products in the 
Lie algebra in the case of Yang-Mills fields and in the metric of the 
target in the case of wave maps.*6 

In particle physics at a classical (i.e. non-quantum) level the dynamics 
is ruled by the standard model Lagrangian with a structure of the 
type 


1 - 1 
Lstandard(@) == / Kaez + wy Duy + 39” DH ‘ DH 


+V (H) + AYHy| ug, 


with ¢ := (F, 4, H, g), F a Yang-Mills field with values in the Lie algebra 
of U(1) x SU (2) x SU (3), w various spinor fields, and 7“ Dirac gamma 
matrices, while H is a complex-valued scalar doublet, the Higgs field; V 
is a potential for H and A is some constant coefficient. 


Exercise IV.9.2 Check the general theorem in the Maxwell and scalar 
field cases. 


Hint: 
5 I gP AaO8b Mg = J 59 AaPOabHg +2 / 9°" a5b03b hg 


+ f 92.8505 


1 F 
= f 60°? (246026 ~ S4aa4*"040,6) -2 f 560° Tabang: 


The theorem is taken to define the equations for sources and the 
stress-energy tensor when only the Lagrangian is known. 


Remark IV.9.2 A constant factor depending on the units in which 
the sources are measured must be put in front of the Lagrangian for the 
sources when it is added to the vacuum Einstein Lagrangian in the case 
of interpretation of observations in non-geometric units. 


IV.10 Observations and experiments 


In Chapter V, we will study the spherically symmetric vacuum Eisteinian 
spacetime, the first exact solution constructed by Schwarzschild in 1916. 
We will explain most of its observable consequences, and briefly quote 
the results of observations, which are all in very good agreement with 
the predictions. 

We will discuss the reality of the revolutionary prediction of Einstein’s 
equations—black holes—in Chapter VI. 

The recently operational gravitational wave detectors VIRGO and 
LIGO have not so far been able to detect any gravitational wave signal. 


This is not so surprising, given the predicted weakness of such signals. 
However, the reality of energy loss by gravitational radiation was 
confirmed as early as 1979 by the observation by Taylor and collab- 
orators of the slowing down of the period of a binary pulsar,4’ which 
was interpreted as a shrinkage of its orbit due to gravitational radi- 
ation. Many other binary pulsars have now been discovered. More than 
30 years of observations and recording of data from the original Hulse— 
Taylor pulsar have led (not without hard work from theoreticians and 
physicists) to information being obtained on several physical properties, 
including the masses, orbits, rotation, advance of the periastron, and 
other parameters, of the pulsar and its invisible companion (both are 
now thought to be neutron stars). All the results obtained agree with 
those predicted by Damour and Deruelle*® from the Einstein equations, 
rather than those predicted by alternative gravitation theories.’ 

General Relativity and the Einstein equations are universally adopted 
as the best model for gravitation at planetary and astronomical scales. 
Their use at the atomic, quantum, scale requires new tools, still 
the subject of much research and discussion. The sub-Planckian and 
cosmological scales pose serious problems that probably require new 
ideas. 

If gravity is to become a quantum field theory like others, there must 
exist an elementary particle called the graviton. This particle should 
be massless like the photon, since gravitation propagates with the speed 
of light. The putative graviton®? is thought to be spin®! 2 for the same 
reasons why the photon is called spin 1 and the electron and the neutrino 
spin 5. 


IV.11 Problems 


IV.11.1 The Einstein cylinder 


The Einstein cylinder”? is the manifold S? x R endowed with the 
static metric 


— dt? + afya, (IV.11.1) 


with aj a constant and y, the metric of the unit 3-sphere 9°, which is, 
in the usual angular coordinates, 


y4 := d8? + sin? 6 (dd? + sin? ọ dy). 
1. Show that setting sin 0 = r gives the form familiar to geometers: 


dr? 


=~] 


+ r?(d¢? + sin? ody). 


y4 (IV.11.2) 
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4TA binary pulsar is a system of two 
stars gravitating near enough to each 
other to be considered as an isolated 
system, one of them being a pulsar. A 
pulsar is a star from which we receive 
strong electromagnetic signals at regu- 
lar intervals. It is now thought that the 
signals are emitted in the direction of 
the axis of a rapidly rotating neutron 
star. 


48Damour and Deruelle (1981, 1986) 
and Damour (1982). 


49For observational and experimental 
results up to the end of 2013, see the re- 
view by Damour for the Particle Data 
Group (Damour, 2013b). 


50The observation of a graviton is very 
unlikely in the near future. 


51 Spin is a quantum notion whose pre- 
cise definition is outside of the scope 
of this book. It takes only integer or 
half-integer values. A supersymmetric 
partner of the graviton, the gravitino, 
would have spin 3. 


52The Einstein cylinder, also called the 
Einstein static universe, was the first 
Einsteinian cosmological model to be 
constructed. 
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Show that the components of the Ricci tensor are (whatever coord- 
inates are used) 


2. Show that the Einstein cylinder is a solution of the Einstein equations 
with perfect fluid source of constant positive energy and negative 
pressure: 


Lo = 3a", po = -a° <0. (IV.11.4) 


3. A negative pressure is unacceptable on classical physical grounds. 
Show that the Einstein cylinder is a solution of the Einstein equations 
with a positive cosmological constant and source a perfect fluid with 
positive energy and pressure. 


4. Show that the Einstein cylinder is locally conformally flat. Determine 
the image of Minkowski spacetime under a diffeomorphism into the 
Einstein cylinder. 


Solution 


1. This is a straightforward computation. 
2. The Einstein equations with cosmological constant A are 


Sap = Tap — Agap. 
We deduce from (11.3) and (11.4) that 
R= bag and Soo = Baa; Si = -yi 


The stress-energy tensor Tag = (u + p)uaug +Ppgag of a perfect fluid 
in the Einstein static universe with static flow vector u? = 1, u? = 0 is 


Too = Ho, Tor =0, Tij = pooyi}, 
and hence (recall that on the Einstein cylinder, goo = —1, gij = ayi ) 
Soo = Too + A, Sij = Tij — aN 
if 
2 


Ho = 30 — A, po =A- a^. 


IV.11.2 de Sitter spacetime 


The de Sitter spacetime has for supporting manifold S? x R, like the 
Einstein cylinder. Its metric, analogous to the Einstein cylinder metric, is 


— dt? +a, (IV.11.5) 


but the coefficient a is now time-dependent. 
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1. Show that the de Sitter metric is a solution of the vacuum Einstein 
equations with positive cosmological constant A if and only if the 
coefficient a is such that 


a R A 
ä—k?a=0, with a:= a ä:= a k= g (IV.11.6) 
and 
a — ka? = 1. (IV.11.7) 


2. Show that the general solution for (11.5) is, with A and B a pair of 
constants, 


a = Ae* + Be™. (IV.11.8) 


Show that such a function a satisfies (11.6) if 4AB = k~?. Show that 


the metric, called the de Sitter metric,” 53The de Sitter spacetime is time- 


symmetric. The radius of its spherical 


— dt? +4 k-?(cosh? kt) [da? + sin? odo? 4 sin? 6 d¢?)| (IV.11.9) space sections expands to infinity in 


both time directions. 


is a solution of the vacuum Einstein equations with positive cosmo- 
logical constant. 


IV.11.3 Anti-de Sitter spacetime 


1. Show that the spacetime metric defined on R? x (—7/2, 7/2) by 
— dt? + (cos? t) 7_, (IV.11.10) 


where y_ is the metric of the hyperbolic 3-space of constant negative 
curvature, 


y- := dx? + (sinh? y)(sin? 6 d? + dé”), 


satisfies the vacuum Einstein equations with negative cosmological 
constant A = —3. 


2. The spacetime classically called anti-de Sitter spacetime is the 
manifold R? x R with the static metric 


gaas := —dt? (cosh? y) + dx? + sinh? y(sin? 6 dø? + dé”), 0< x < oo. 
(IV.11.11) 


Show that it is conformal®* to the half 0 < 0 < m/2 of the Einstein 54This property has been used by 
cylinder. Choquet-Bruhat to prove the global ex- 
istence of Yang—Mills fields in anti-de 

3. Show that no light ray emitted at time to will go beyond a space slice Sitter spacetime. It plays an important 


at finite time t = tọ + T. role in recent work by H. Friedrich. 


Remark IV.11.1 Five-dimensional anti-de Sitter spacetime plays 
an important role in supersymmetric theories conformal field, in 
particular with regard to the so-called AdS—CFT correspondence. 
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Solution 


1. This is straightforward computation. 
2. Set 


sin a 


1 
cosh y = aan hence sinh y dy = me 


Computation using the identities satisfied by sines and cosines gives 
the announced identity: 


gaas = ——[-dt? + da? + sin? a(sin? 6 do? + d6?)]. 
COS* Q 


. We have on a radial light ray 


x 
es , hence t= ¢ +f dx ; (IV.11.12) 
cosh x xo cosh x 
and hence, whatever xq is, 
X d wd 
t-to< f as <2 / PS as: (IV.11.13) 
o cosh x o eX 


Therefore, no light ray emitted at time tọ will go beyond the space 
slice t = to + T. The anti-de Sitter spacetime has no global causality 
properties; it is not globally hyperbolic (see Chapter VIII). 


IV.11.4 Taub-—NUT spacetime 


The underlying manifold of the Taub spacetime is S x I with 9? a 
three-dimensional topological sphere and J := (t_,t) an interval of R. 
This manifold is endowed with the Lorentzian metric 


—U~"dt?+(20)?U (dy+cos 0 dd)? +(t? +") (d6?+sin? Odd”), (IV.11.14) 


where 
te =m+(m +2)? (IV.11.15) 
and 
O _ 2(mt + L) 
U=-l+—aop (IV.11.16) 


The numbers £ and m are positive constants. The coordinates Y, 0, ¢ are 
the Euler coordinates on the sphere $°. 


I; 


Show that the Taub spacetime is a solution of the vacuum Einstein 
equations. 


. Show that U tends to zero as t tends to t} or t_. Show that the 


metric is still Lorentzian when t is in the complement in R of the 
closure of J, but w is now a time variable and t a space variable. 


. Show that by setting 


1 f' dr 
= — —— IV.11.17 
abt z f UGY’ ( ) 
the Taub metric extends to a C% metric on S? x R, called Taub- 
NUT:55 


Al? dy? — 40 dọ dt + 40°U cos 0 do dy — 2l cos 0 dt do (IV.11.18) 
+(t? + L) (d0? + sin? 0 d¢?). E 
. Show that the two hypersurfaces, diffeomorphic to S3, t = m+ (m? + 
(2)2, which bound the Taub spacetime in Taub—NUT are null mani- 
folds, generated by closed null geodesics where w only varies. They 
are the Cauchy horizons of the maximal development of any Cauchy 
surface (see Chapter VIII) in the Taub spacetime. 


. Show that one family of null geodesics issuing from a point of the 
Taub region crosses both horizons, and that another family spirals 
near these surfaces and is incomplete. Show that in the NUT region 
there are closed timelike curves (the surfaces t = const are timelike 
there). 


. Construct another extension of the Taub spacetime with analogous 
properties, though not isometric to it, by the change of coordinates 


a 1 ft dr 
Y -4-3 T (IV.11.19) 


Remark IV.11.2 The Taub-NUT spaces give counterexamples to sev- 
eral conjectures (see Chapter VIII). The consolation is that they are not 
generic, because of their symmetries. 


IV.11.5 The quadrupole formula 


Many approximation methods in General Relativity use approximations 
of a Newton-type potential U generated by a mass of density p given in 
geometrical units (x = 1) by the integral 


U(x) = PUY) dy, dy = dy! dy? dy’. 
re |£ — yl 


1. Show that 1/|x — y| admits the following Taylor expansion about 


y = 0: 


qizi 


1 1 i£ 1 joa ace 
eal A > Yat 5 5 (3y'y -r J Ta T 
i=1,2, 


with r? := sigs, and r’? := Peer cee 
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5 ; ; : 
55For its discoverers, Newman, Unti, 
and Tamburino. 
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2. Write, for a point x outside the support of the function p, 


M ae A priri 
U(a) = — ie DD SHa. 


ae r 
uJ 


Give the expressions for M, Dt, and Q”, which are called respectively 
the mass, the dipole moment, and the quadrupole moment of the 
matter density p. Show that U reduces to —M/r if p is spherically 
symmetric. Show that D? can be made zero by choosing for origin 
the Newtonian centre of gravity of the matter. 


3. Show that for a non-spherical mass, the gradient of U will in general 
differ from —M/r? by a term in 1/r+. 


IV.11.6 Gravitational waves 


Give details and draw images of the displacement of the plane-wave 
perturbation of a Minkowski metric. 


IV.11.7 Landau—Lifshitz pseudotensor 


By analogy with other fields, it was thought long ago that a locally 
defined energy for the gravitational field should be defined in a post- 
Minkowskian approximation by a quadratic form in the first derivatives 
ahap, hag := 6Gab=Jos — Nag. A simple-minded way to obtain such a 
quadratic form is inspired on the one hand by the Einstein equations with 
right-hand side the stress-energy tensor of matter or other fields and 
on the other hand by the classical Picard iteration method for solving 
quasilinear equations. 


1. Show that in vacuum the first approximation h := dg at 7 of a metric- 
g solution of the Einstein equations satisfies the linear equations 


1 1 
Rag = Lap(h) = — 50 Orhaat5 [33h + Oy Agha = Ie Agh3 | = 0. 
(IV.11.20) 


2. Show that the Picard method gives for the approximation h) of 
order two the following linear equations with source quadratic in the 
first approximation h: 


Laph”) = h™ [8,(Oahpp + shau — Ophap) — OaAghay] + tap, 


with tag quadratic in first derivatives, namely 


1 
tag = —Oyh™ (Ouheu t+ Ophap — Onhap) + zeh Ooh (IV.11.21) 


1 
+ 59° hp (Gahan + Oghar — ahap) + NK hey — ONLI, HB. 


IV.11 Problems 


The tag are the components of the Landau—Lifshitz pseudotensor. 
This was considered by Landau and Lifshitz as representing the 
stress—energy of the first-order perturbation h. 


IV.11.8 High-frequency waves from a spherically 
symmetric star 


Take as background the spherically symmetric Vaidya metric, which 


in the coordinates xz? = u, zt =r, (x4) = (0, ¢) reads 


g:=-— h - i (du)? — 2dudr + r° (d0? + sin? 0 dd’). 


r 


1. Check that the Vaidya metric coincides with the Schwarzschild metric 
(see Chapter V) when m is constant. Write down its contravariant 
components. 


2. Write the conditions that must be satisfied by the high-frequency 
wave 


g(x,wu) = g+w'o(x,wu), tw w(x, wu), with x := (u,7,6,¢) 
for it to be an asymptotic solution of the vacuum Einstein equations 
with phase u 

(a) of order zero; 


(b) of order 1, in a gauge such that vp; = 0. 


3. Show that the first term v in the wave falls off at infinity like r~!, 

as the background metric. Show that the second term w falls off like 
—2 
PA 


Solution 


1. Elementary computation gives 


2 
Detg = — h — =i r* sin? 0, 
r 
4 an2 
00 OA 1A or sn’d yy 
g g g » g Detg 9g 


2. The coordinate function u is isotropic for g because g““ = 0. The 
gradient of the phase p= u = x° is yy = 02. 


(a) For a solution of order zero, the polarization conditions 


and 
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must be satisfied; that is, using previous relations, 


i 10 AB 
g Via = g Viot Gg VAB, 


1 1 
g^PvaB = -3 (ne + > vas) = —v]0. (IV.11.22) 
r sin^ 0 


To have an asymptotic wave of order one, we must have R =0. 
If we choose the gauge such that vio = 0, then the propagation 
equations, R = 0 are found to reduce to 


1 
OWAB = plas = 0. 
These equations integrate to 


VAB (x, £) = ryaB(u, 0, Q, £): 
We set 


Yo2 =: &, %23 =: p. 
The polarization conditions are then equivalent to 
5 = —Q sin? 8. 


To have a progressive-wave solution of order one of the vacuum 


Einstein equations, it remains to satisfy the equations RY) = 0. 


The equations RO = 0 are found to be 


1 


R = =z _ 0, 


a re TEE reaa i 
Roy = -=w 4 - (aa + 2g os + 20' 0 ; 


1 cos @ y 
RO = —-w", 4 023’ — za’ + = 0. 

a Oe 28 3 sin 0 B 
We see that, given a and 8 uniformly bounded functions of € as 
well as their primitives with respect to €, these linear equations 
in wi, also linear in a’ and 8’, are satisfied by functions w{; such 
that there exist functions w4; that are uniformly bounded in £. 


It remains to satisfy the equation R®) = 0, which reads 


2 n 2 
1 giw! 4 1 a24 a _ 1 a4 - 2 dm 
2| VY re sin“ 0 2r? sin“ 0 r? du 
(IV.11.23) 


This equation can be satisfied by a w uniformly bounded in € only 
if the right-hand side is also the second derivative of a function 
uniformly bounded in €. This will be the case if 


Happi =0 (IV.11.24) 
a\° Taimo) du _ 


for example, by assuming dm/du < 0 and setting u(u) := 
2(—dm/du)?, 


a= p(u)sin€sind, 8 = u(u) cos ¿sin 8 cos 6. (IV.11.25) 


One finds bounded w,; satisfying all the required equations: 


w12 = pe cos € (cos 0 + 2), (IV.11.26) 
w3 = we) sin € sin? 0, (IV.11.27) 

and 
wan = 0. (IV.11.28) 


Remark IV.11.3 The asymptotic metric obtained is axially sym- 
metric (the coefficients do not depend on vy), but not spherically 
symmetric. It is conjectured and partially proved that, in agree- 
ment with physical intuition, no progressive gravitational wave with 
spherically symmetric background is spherically symmetric. 


3. This is left to the reader. 


IV.11.9 Static solutions with compact spacelike 
sections 


Show that in dimension 3+1, the only locally static solution of the Ein- 
stein equations with compact spacelike sections and source such that 
Poo = 0 is vacuum and locally flat. 


Solution 


The Einstein equations for a locally static spacetime metric are 
Rij — YV = poi, pio =0, PAgt = poo. (IV.11.29) 


If M is a compact manifold (without boundary) and Ag% = V;0"y, then 
it holds that 


0 = Agvug =) YT poolig, hence poo =0 ify>0 and poo = 0. 
M M 


Therefore, Agy = 0. On a compact manifold, this implies Y = constant, 
and the equations imply Ricci(g) = 0. We know that Ricci-flat three- 
dimensional manifolds are locally flat. 


IV.11 


Problems 
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56 Arnowitt—Deser—Misner. 


IV.11.10 Mass of an asymptotically Euclidean 
spacetime 


The ADM”? mass of an asymptotically Euclidean manifold (M, g) is 
defined by a flux integral through the sphere at infinity of its end: 


T i 
MADM := Tör jim oa (0:91; as Oj Gii)N du. 


r 


Consider the asymptotically Euclidean metric on R? Ar > 0, 
r =r +y t, 


4 
(1 4 z) (da? + dy? + d2?), 
2r 


isometric with the space part of the Schwarzschild metric in standard 
coordinates (see Chapter V). 
Show that its ADM mass is equal to m. 


Solution 


For the given metric gi; = (1 + m/2r)*6;;, 6;; = 0 for i Æ j, 
m\3 m xi m\3 m xi 
dig =4 (1+ 2) | ðjga=12(1+ Z) 
Jij 2r ( 2r2 -) ig 2r 2r2 r 


3 Jj ; : 
(Omg — OjJii) =8(1+ ~~) (z a ) Se Sig 


2r 2r2 r r3 r2’ 


and hence, with nin; = 1 and 4r the surface of the unit sphere $°, 


1 i 
MADM = Tgr L (ðigij — O;9u)n* du = m. 


IV.11.11 Taub Lagrangian 


Consider the Lagrangian with constraint g(u,u) = —1 and Lagrange 
multiplier À: 


g= / {R(g) + [utp tapa ia 


Prove by choosing coordinates adapted to the fluid flow that the Euler 
equations for such a Lagrangian lead, with the choice \ = p, to the usual 
Euler equations for perfect fluids and the Einstein equations with source 
their stress—energy tensor. 


The Schwarzschild 
spacetime 


V.1 Introduction 


In 1916, soon after the publication of Einstein’s equations, an exact 
solution was constructed by Schwarzschild that could model the gravi- 
tational field outside a spherically symmetric isolated body such as the 
Sun. It was shown that, in first approximation, the relativistic planetary 
trajectories, i.e. timelike geodesics of the Schwarzschild metric, coincide 
with the Kepler orbits. However, in the case of Mercury, the planet near- 
est to the Sun, an additional advance of its perihelion of about 42” per 
century beyond the Newtonian calculuation was found,! in agreement 
with astronomers’ observations. Other effects predicted by the Einstein- 
ian gravitation theory, namely deflection of light rays, redshift, and time 
delay, have also been found to be in remarkable agreement with obser- 
vations and experiments (see Sections V.8 and V.9). In Chapter VI, we 
will treat a revolutionary property of Einsteinian spacetimes, the exist- 
ence of black holes, which first appeared through investigations of the 
Schwarzschild metric. 


V.2  Spherically symmetric spacetimes 


We start with the following natural and elementary definitions. 


Definition V.2.1 A three-dimensional Riemannian manifold (M, g) is 
said to be spherically symmetric if 


1. The manifold M is represented by one chart (U,®) with ®(U) = R? or 
the exterior of a ball B of R? centred at some point O. We denote by 
p, 0, spherical (pseudo) coordinates in ®(U), linked to the canonical 
coordinates x,y,z of R? by the usual relations 


x=psinésing, y= psinĝ cos, z = pcosé. (V.2.1) 


2. In (U), given by p > p >0,0< 0< 27,0 < p < 27, G is 
represented by a metric of the form 


edp? + f?(p)(db? + sin? 6 dy’), (V.2.2) 


V.1 = Introduction 105 
V.2 Spherically symmetric 
spacetimes 105 
V.3 Schwarzschild metric 107 
V.4 Other coordinates 108 


V.5 Schwarzschild 
spacetime and event 
horizon 111 


V.6 The motion of the 
planets and perihelion 


precession 112 
V.7 Stability of circular 
orbits 117 


V.8 Deflection of light rays 118 
V.9 Redshift and time 


dilation 121 
V.10 Spherically symmetric 
interior solutions 122 


V.11 = Spherically symmetric 
gravitational collapse 127 


V.12 Problems 132 


1t Actually, this was first obtained by 
Einstein, in November 1915, by a per- 
turbative calculation where he solved 
the Einstein equations by successive 
approximations, going to second order 
inG E. 
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?Equivalently, the lapse and the shift of 
the slicing (see Chapter VIII). 


with f a monotonically increasing function of p. The interpretation 
is that ®(U) is foliated by metric 2-spheres p = constant, centred at 
O; their areas in the metric (2.2) are 4r f?. The metric (2.2) is the 
general form of a metric invariant under rotations in RÌ, centred at 


O. It is defined on the whole of R? if the ball B is empty. 


Remark V.2.1 In the preceding definition, p = 0 represents, by def- 
inition, a single point O. The vanishing of f7(0) does not imply a 
singularity in the metric, but reflects the fact that spherical ‘coordinates’ 
are not admissible coordinates at p = 0. 


The choice of the coordinate r given by r = f(p) is called the 
standard choice. 


Definition V.2.2 Consider a spacetime (V,g) with V contained in the 
product R? x R, a point of V being labelled (x,t). Suppose the subsets M, 
of constant t are spacelike submanifolds; then we denote by gą the Rie- 
mannian metric induced by g on Mı. The trajectories of the vectors 0/Ot 
are supposed timelike. The spacetime is said to be spherically symmetric 
if the following hold: 


1. Each manifold M, has a representation as the exterior R? — B; of 
a ball B, of R? centred at the origin O. Each manifold (Mz, gt) is 
spherically symmetric. In R? — B, the metric gi reads, in standard 
coordinates, 


gi = 8) dr? + r? (d8? + sin? 0 dy’). (V.2.3) 


2. For each t, the g length and the representative of the projection on My 
of the vector 0/Ot tangent to the timeline? are both invariant under 
the rotation group defined above. 


Lemma V.2.1 A spherically symmetric spacetime (V,g) admits a 
metric of the form 


g = —e” dt? + eò dr? + r?(d6? + sin? 6 dy’), (V.2.4) 
where À and v are functions of t and r only. 


Proof. A scalar on R? — B invariant under the rotation group is neces- 
sarily a function of r and ¢ alone. A vector field invariant under this 
group is tangent to the radial lines (lines where only the r coordinate 
varies) and its magnitude depends only on r for each t. Therefore, the 
given definition implies that 


g = —a?(r, t) dt? +2b(r, t) dt dr+e* dr?+r?(d?+sin? 0 dy”). (V.2.5) 


We can eliminate the diagonal term in dt dr by computing an integrat- 
ing factor for the 1-form in two variables w := a dt — bdr, i.e. a function 
of t and r, which we denote by e~?”, such that its product by the 1-form 
is the differential of a function rT: 


ew =e ™(t,r)(adt — bdr) = dr. 


We take 7 as a new time coordinate, keeping r,6, as space pseudo- 
coordinates. Writing the metric (2.4) with this new time coordinate, 
renamed as t, gives the formula (2.3). The coordinates t,r are called 
standard coordinates. a 


Exercise V.2.1 Given a and b, determine v. 


Remark V.2.2 If the point with coordinate r = O belongs to the 
manifolds M,, it describes a timelike line called the central world line. 


V.3 Schwarzschild metric 


We will prove the following theorem: 


Theorem V.3.1 A smooth spherically symmetric metric is a solution 
of the vacuum Einstein equations if and only if it is the Schwarzschild 
metric, which reads, in standard coordinates, with m a constant 


-1 

JSchw = (1 x) dt? 4 (1 m) dr? + r?(d6? + sin? 6 dy). 
r r 

(V.3.1) 


Proof. We set t = 2°, r = xt, 0 =27,y = x? and denote by a prime the 
derivative with respect to r. We find that the only non-zero Christoffel 
symbols of the spherically symmetric metric (2.4) are 


1 v— v' OÀ = Or 
Poo = go Too = 38r’, Por = ga rì = cS àY, To = D? 
(V.3.2) 
ri as T3 =—re, Tt r? rt, TL, = —r sin? 8 e7> 
11 9? 22 — ; ja = 413 = j 33 = ; 
(V.3.3) 
Pes = — sin ĝ cos 0, Fe = cot é. (V.3.4) 


Computing the components of the Ricci tensor, we find 


Rio = roA, Rə2 = =e [1 + 5 = a’) + I; R33 = sin? 0 Roo. 
(V.3.5) 


We deduce from these identities that for a solution of the vacuum 
Einstein equations, Rag = 0, À is independent of t. Therefore, v’ is also 
independent of t, Ov’ = 0, and v is of the form 


v(t,r) =v(r) + f(t). (V.3.6) 


We set T =f e2/(” dt and we rename 7 as t; we then have Ov = 0. The 
other non-identically zero components of the Ricci tensor then reduce to 


i 12 FAT / 
ANY v V'À v 
Roo -i ( 2 + 4 4 + = ) ; (V.3.7) 
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V 
=a p2, (V.3.8) 


The vacuum Einstein equations therefore imply 
r(eè™” Roo + Ru) =v +’ =0. (V.3.9) 
Modulo this relation, the equations Roz = 0 and R33 = 0 reduce to 


—eA(1—rW)+1=0; 


that is, 
(ey + — =-, (V.3.10) 
r r 
The general solution of this linear equation for e~* is 
A A 
e*=1+ —, hence e” = B (1 + 2) : (V.3.11) 
r r 


with A and B arbitrary constants. The constant B can be made equal 
to 1 by a rescaling of t. The constant A is denoted by —2m; we will 
see in Section V.6, Equation (6.8), that the Newtonian approximation 
corresponding to the Schwarzschild metric then coincides with Newton’s 
gravity, with the coefficient m being the gravitational mass expressed in 
units of length. We have thus obtained the metric (3.1). 

To show that this metric satisfies the full vacuum Einstein equations, 
we must check that it satisfies also the equations Rog = 0 and R11 = 0; 
this can be done by using (3.9) and (3.7), (3.8), (3.11) to show that 
eò” Roo — Ry, =0. E 


Exercise V.3.1 Prove this last statement by using the Bianchi iden- 
tities. 


Remark V.3.1 We have supposed in this theorem that the areas of the 
orbits of the symmetry group are monotonically increasing along their 
orthogonal trajectories. 


In the course of the proof, we have obtained Birkhoff’s theorem: 


Theorem V.3.2 (Birkhoff) A smooth spherically symmetric metric 
solution of the vacuum Einstein equations is necessarily static. 


V.4 Other coordinates 


In some problems, it is useful to use alternative, non-standard, coordin- 
ates for the Schwarzschild metric. 


V.4.1 Isotropic coordinates 


One defines new coordinates X,Y, Z, called isotropic, on R3 x R that 
are related to the standard r,0,¢@ (i.e. x,y,z) by setting 


m\2 
=R(1 z) Ad 
r R( Foa) (V.4.1) 
X = Rr tz, Y=Rr-'y, Z= Rrtz. (V.4.2) 


In terms of these coordinates, the metric reads, with R? = X? +Y? +22, 


2R- m . m\4 
= dt? + (14 dX? + dY? 4+dZ?). (V.4. 
tsew =- (Bra) a+ (1455) (aX? AY? + dZ). (VAS) 


Exercise V.4.1 Prove this formula (see the solution of Problem 
V.12.4). 


It is clear from the above expression that the spaces t = constant of 
the Schwarzschild metric are conformal to Euclidean space. 


Remark V.4.1 ris a monotonically increasing function of R, from 2m 
to infinity, when R increases from m/2 to infinity; r and R are equivalent 
for large r. 


V.4.2 Wave (also called harmonic) coordinates 


Theorem V.4.1 The 3+4+1 metric,’ defined for 7 >m, 
POM yo ETM gee (F + m)? (d0? + sin? 6 dd”) (V.4.4) 


r+m T—m 


is isometric to the Schwarzschild metric by the mapping r=7-+m. The 
corresponding Cartesian coordinates t and xt defined by 


x! =FsinOsind, x? =FsinOcos¢, x? =Fcosé 


are wave coordinates. 


Proof. We write an arbitrary spherically symmetric, static, metric on 
R? x R in the form 


— A? dé? + B? dr? + r?(dé? + sin? 0 d’), (V.4.5) 


where 0, ¢ are spherical coordinates on 8°. The coefficients A, B, and r 
are functions of 7 only. The coordinate t is obviously a wave coordinate. 
We look for a function r = f(7) such that the coordinates x’ defined by 
the above relations are wave coordinates, i.e. such that the functions zt 
satisfy the wave equation. For an arbitrary function w, the wave equation 
in the metric (4.4) reads 


e | >: (an=) +a], (V.4.6) 


A2 ðt? ` r2 | ABF 
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3Note that while the standard form of 
the Schwarzschild metric generalizes to 
higher values of n, its expression in 
wave coordinates does not (see Prob- 
lem V.12.5). 
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4Pretorius (2005a). 


where A* is the Laplacian on the sphere 9°. If x? is one of the functions 
given above, then 


ðr? r? ori 


s = A*tg* = —22". V.4.7 
OF F 0 and x x ( ) 
Hence the condition that the x’ be wave coordinates reduces to 
1 d 
— — (ABr?) —2F =0. A, 
AB aR r*) —2r=0 (V.4.8) 


In the case of the Schwarzschild metric, we have for r > 2m 


2 d 
Az=ii-™, Baa, (V.4.9) 
r dr 
and hence the harmonicity condition is 
d (dF 5 5 n difdr, ys 
= — 2r = .4.1 
a (Za ) 2 a É (rf — 2mr) r=0 (V.4.10) 
We set 
r=m(1+2), (V.4.11) 
and (4.10) becomes 
d 5 dr 
= 1. 4.12 
I E DZ 2z=0, for z> (V ) 


This linear second-order differential equation is a Legendre equation, 
whose general solution is, with C1 and C2 arbitrary constants 


F=Ciz+Cy (in i). (V.4.13) 


To avoid introducing an additional logarithmic singularity on the horizon 
r = 2m, we take 


F=omz, ie r=r+m. (V.4.14) 


Using this value of r, we can check that the Schwarzschild metric in the 
wave coordinates x’ defined above is the given metric. E 


V.4.3 Painlevé—Gullstrand-like coordinates 


The following stationary but non-static form of the Schwarzschild metric 
has been used in numerical computations:+ 


2 2 [2 
(1 z) dt? + dz? + dy? + d2? + =4/ M (a dx + ydy + zdz) dt. 
r r\ r 
(V.4.15) 


It is called a boosted Schwarzschild metric. 


V.5 Schwarzschild spacetime and event horizon 111 


V.4.4  Regge—-Wheeler coordinates 


We define, in the region r > 2m, a tortoise radial coordinate 
p=r+2mlog(r — 2m). (V.4.16) 


The metric then takes a form in which the timelike sections 0 = constant, 
ġ = constant are conformal to two-dimensional Minkowski space: 


2 
9Schw = (1 — m) (—dt? + dp?) + r?°(d0? + sin? 0 d?),  (V.4.17) 
P 


where r is the function of p defined by (4.16). 


V.5 Schwarzschild spacetime and event 
horizon 


If m = 0, the Schwarzschild metric reduces to the Minkowski metric; 
if m Æ 0, it is singular for r = 0 and has a coordinate singularity for 
r = 2m (see below). 

The sign of the constant m Æ 0 is very important in determining the 
properties of the Schwarzschild metric. 

If m < 0, the Schwarzschild metric defines a spherically symmetric 
spacetime on the whole of R? x R, except at {0} x R, where the met- 
ric is singular. At present, the Schwarzschild metric with m < 0 has no 
physical interpretation. It can be shown that there exists no asymptotic- 
ally Euclidean Einsteinian spacetime (R° x R, g) with sources of positive 
energy that coincides for r > a > 0 with a Schwarzschild metric with 
m < 0; this is a particular case of the positive-mass theorem.” 

If m > 0, the Schwarzschild metric is a regular Lorentzian metric with 
t timelike and r spacelike and r > 2m; 2m is called the Schwarzschild 
radius. For r large with respect to m, we will see in the following sections 
that the resulting physical properties coincide in first approximation to 
those of the Newtonian theory for a spherical body of gravitational 
mass m centred at r = 0. 

It is possible to construct so-called interior Schwarzschild solutions 
that are spherically symmetric spacetimes, not necessarily static, smooth 
on a manifold R x RÌ N {r < a}, satisfy the Einstein equations with 
sources of positive energy, and lead to a complete admissible Einsteinian 
spacetime on the whole manifold Rx R3, with, for r > a, a Schwarzschild 
spacetime of mass 2m < a (see Section V.10). 

Since 1 — 2m/r vanishes for r = 2m, the Schwarzschild metric in 
standard coordinates appears to be singular there: goo vanishes and grr 
becomes infinite. For r < 2m, r # 0, the Schwarzschild metric is again 
a regular Lorentzian metric, but the timelike and spacelike character of 
the coordinates t and r are interchanged. 

We will construct later, whatever the value of a > 0 , vacuum Ein- 
steinian spacetimes that are regular for r > a > 0 and isometric for 


5See, for instance, YCB-OUP2009 and 
references therein. 


®Recall that the gravitational and in- 
ertial masses differ by a factor equal 
to the gravitational constant. In this 
book, unless otherwise stated, we work 
in units such that this constant as well 
as the speed of light are equal to 1. 
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TFor the Sun, 2mgun = 2.96 km and for 
the Earth, 2MEarth = 8.87 mm. 


r > 2m to a Schwarzschild spacetime in canonical coordinates with ra- 
dius 2m. They are interpreted as spacetimes modelling the exteriors of 
bodies with Schwarzschild radius 2m. However, the apparent ‘Schwarz- 
schild singularity’ r = 2m > a has a deep physical meaning that we will 
discuss in Chapter VI: no light ray or other classical signal (i.e. one not 
due to a quantum effect) can escape from the regions r < 2m. For this 
reason, the hypersurface R x {r = 2m} is called an event horizon and 
a spacetime with source of radius a < 2m is called a black hole (see 
Chapter VI). 

If a is identified with the radius of the body in a classical CGS length 
unit and the gravitational mass m determining the Schwarzschild radius 
is expressed in the same unit of length, but with the units of time and 
mass chosen so that the speed of light and the gravitational constant are 
both equal to 1, as we do in writing the equations (see Problem III.9.3 
in Chapter III), then the Schwarzschild radii of the solar planets and 
normal stars are very much inside these bodies:’ 2m << a. 

In the rest of this chapter, we will study properties of Schwarzschild 
spacetimes in the region r > 2m. We will compare their predictions with 
the Newtonian ones and unravel new effects of gravitation predicted by 
Einstein’s General Relativity. We will also quote results of observations 
and experiments, all in agreement with Einstein’s theory. 


V.6 The motion of the planets and 
perihelion precession 


V.6.1 Equations 


The trajectories of bodies of small size and mass in a spherically symmet- 
ric gravitational field, for instance the trajectories of the solar planets, 
are timelike geodesics of this field, i.e. of the Schwarzschild spacetime. 

We denote by ds the element of proper time on a timelike curve; i.e. 
with our signature convention, 


ds? = —gagdx*dx*. (V.6.1) 
As in the Newtonian case, we find that the orbits remain in a ‘plane’ 
of R? by considering first the equation (cf. the expressions for the 
Christoffel symbols) 
O 2drdéb dy\? 
— + —— —sinĝcosð | = ) =0. 
ds? k rdsds "7790S (2) 
We choose the coordinate 0 such that at some initial instant sọ we have 
for the considered motion of the planet 0(s9) = 7/2 and (d@/ds)(s9) = 0. 
The equation satisfied by 0 then implies that the orbit remains in the 
‘plane’ 6 = 7/2. With this choice of the coordinate 0, the equation for 
p reduces to 


(V.6.2) 


de 2drdy 
de? + ame 0, (V.6.3) 
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which integrates to an analogue of the Newtonian area law, with £ some 
constant such that 


d 
e£ y, (V.6.4) 
ds 


Remark V.6.1 Equation (6.4) is a consequence of the invariance of 
the metric under rotations. The constant £ can be interpreted as the 
angular momentum per unit mass, as seen at large distance. 


The equation 


dt — dvdr dt 2m 
— + ——— = ith ‘= | 1 — — .0. 
72 + T ds d; 0, wi v os ( 7 ) (V.6.5) 
integrates to (‘energy’ integral due to t-translation invariance) 
2 dt 
(1 — m) — = E, Ea constant. (V.6.6) 
r } ds 


The remaining geodesic equation is 


2 2 is 2 
d?r l l1dà (dr a dy | e dv [ dt l (V.6.7) 
ds? 2dr \ds ds 2 ds \ds 

Using the expressions for À and v together with the previous integrals 

(6.4) and (6.6), we see that, when £ 4 0, this equation reduces to 


du m 


1 
de +3mu?, with u = ms 


(V.6.8) 
This equation is formally the same as the linear equation in u of New- 


tonian mechanics, except for the addition of the nonlinear term 3mu?. 
Therefore, it holds that 


u(y) = UNewton (P) F v(e), (V.6.9) 
where v satisfies the equation 


dv 2 
As is well known, the solutions of the Newtonian equation (for 4 4 0) 
are the conics 


L = ml? (1 +e cos p) =: UNewton; (V.6.11) 
where e is the eccentricity and where the longitude of the perihelion, a 
constant in this Newtonian case, has been taken to be equal to zero. The 
constant e depends on the initial position and velocity of the planet. It 
is not far from 0 for the solar planets—their orbits are almost circular. 

For the solar planets, r is large with respect to £, and hence 3mu? 
can be considered as a small correction to mé~?; the correction v will 
therefore be small with respect to uNewton. An approximate solution of 
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(6.11), a small correction to the Newtonian expression, is obtained by 
replacing (6.11) by the linear equation 

dv 2 — 3,3 9—4 ey 

IF + V = 3MUNewton = 3m°L “(1 + 2e cos p + e“ cos“ p). (V.6.12) 
This is the differential equation for a forced oscillation. The general 
solution is the sum of the general solution of the associated homogeneous 
equation, i.e. an arbitrary periodic function of period 27 in ọ, and a 
particular solution, for instance 


1 1 
3m3é-4 |1 + ey siny + e? (5 — 5008 2) : (V.6.13) 


The term that will give the observationally most significant contribution 
to the correction is the ‘secular’ (non-periodic) term in y; that is, we 
consider the Einsteinian approximation: 


UFinstein ~ UNewton T 3ml teg sin p 
= me-*(1+ecosy+e3m7l"ysiny).  (V.6.14) 


In geometric units, m?0~? is small, and therefore 3m7?¢~7y is equivalent 


to sin(3m?é-7~p) and uginstein is equivalent to 
URinstein ~ MLT? {1 + ecos[(1 — 3m7£-2)y]}. (V.6.15) 


The orbit is no longer a closed curve (unless it is circular, i.e. if e = 0), 
because Upinstein does not have period 27 in y. It is only approximately 
an ellipse; the closest point to the centre (perihelion), attained for y = 0, 
is attained successively after each increase of y by 27(1 — 3m?0-?)-1~ 
Qn + 6rm?l-?. The additional 67m?é-? is the famous Einsteinian 
perihelion precession. 


Remark V.6.2 Elementary calculus gives a perihelion precession per 
orbit 


67m 2 
—, p:=a(l — e°). 
P 
p is called the parameter of the Newtonian ellipse, a is its semi-major 
axis, and e its eccentricity. For equivalent a’s, the precession of the 
perihelion is greatest for eccentricities approaching 1. 


V.6.2 Results of observations 


It was observed long ago by astronomers that the orbits of the solar 
planets are not exact ellipses, but slowly rotating ones. This phenom- 
enon was interpreted in Newton’s theory as a result of the influence of 
other planets. The perihelion precession of most planets could thus be 
accounted for—except for that nearest the Sun, Mercury, for which a 
precession of 42” per century over the 5600” observed (found by Lever- 
rier in 1845) remained unexplained. The Einsteinian correction (1916) 
just filled the gap, which was a remarkable success for the new theory. 
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For a long time, it was thought that the not exactly symmetric shape of 
the Sun could also play a role but was too difficult to estimate,® but re- 
cent data have estimated the quadrupole moment of the Sun and shown 
its contribution to Mercury’s perihelion advance to be quite small. 

In 1974, a pulsar (a rotating neutron star, PSR 1913+16) was observed 
by Taylor and collaborators, orbiting around a companion that is most 
probably also a neutron star. Its orbit shows a precession of about 4.2 deg 
per year, which is about 27100 times the precession rate of Mercury. 
This is believed to be an Einsteinian effect, but since the masses of the 
orbiting objects are not known, the precession cannot be used directly 
to test General Relativity; it is used instead to estimate these masses. 
To do this, one must generalize the above treatment of the dynamics of 
a small mass around a large one to the study of the relativistic dynamics 
of two comparable masses. 

In order to compare the predictions of General Relativity with the 
observations of binary pulsars, it is necessary to go beyond the first post- 
Newtonian approximation presented in Section IV.5.3 in Chapter IV. 
First, as a neutron star is an object that generates at its surface a very 
strong deformation of the Minkowski metric (gẹ ~ —1 + Agurface, 
with hège ~ 2GM/c?R ~ 0.4 for a neutron star, to be compared with 
ngefece(Sun) ~ 1076 and hg§yrf°e(Earth) ~ 1078), one needs to develop 
approximation methods that go beyond the usual weak-field expansion 
and are able to deal with the motion of strongly self-gravitating bodies. 
Such methods, based on the multichart approach mentioned in Sec- 
tion IV.5.3, were developed by several authors in the 1970s and 1980s.° 
These approaches use the method of matched asymptotic expansions 
to transfer information between the tidal-like expansions of the metric 
Gâ (X1) considered in the local coordinate system X{ attached to each 
body (labelled by A) and the post-Minkowskian (or post-Newtonian) 
expansion of the metric g,,(x*) considered in the global coordinate 
system z”. In addition, it was necessary to push the accuracy of the 
post-Newtonian expansion used in the global chart x” much beyond the 
IPN level. More precisely, it was necessary to go to the 2.5PN level, 
corresponding to keeping terms (v/c)° smaller than the Newtonian-level 
terms. The first complete results in the 2.5PN approximation were ob- 
tained in the early 1980s.!° At this level of approximation, the fact that 
the gravitational interaction between the two bodies propagates (via a 
retarded Green function of the wave equation) at the velocity of light 
entails an observable effect: namely a slow decrease of the orbital period 
P, given by?! 


dP, 1927 531+ Bet Bet 
(GMa an 


— =y V.6.16 
dt 50 ( ) 
where v = mamp/(ma + mpg)? is the symmetric mass ratio, M = 
ma + mpg the total mass, Q = 27/P, the orbital frequency, and e 
the eccentricity of the orbit. This effect has been observed in several 


8In the case of satellites orbiting the 
Earth, the irregularities of their mo- 
tions are used to determine the shape 
of the Earth. 


See, notably, D’Eath (1975) and 
Damour (1983a). 


10See Damour and Deruelle (1981) and 
Damour (1982). 


11 See Damour (1983b). 
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12See Damour and Deruelle (1985, 
1986). 


13For a review, see, for instance, 
Damour (2013b). 


binary pulsars, with a magnitude in precise agreement with the Gen- 
eral Relativity prediction (6.16). As this effect is a direct consequence 
of the propagation of the gravitational interaction at the velocity of 
light, it constitutes an observational proof of the reality of gravitational 
radiation. 

In addition, the development of the relativistic theory of the ‘tim- 
ing’ of binary pulsars!? has allowed several other comparisons between 
the predictions of Einstein’s theory and binary pulsar data. More pre- 
cisely, the measurement of n ‘post-Keplerian’ parameters appearing in 
the Damour—Deruelle timing formula allows one to extract n — 2 tests 
of General Relativity (or alternative theories of gravity). For instance, 
in the case of PSR 1913+16, one could use pulsar timing data to meas- 
ure the following three ‘relativistic’ or ‘post-Keplerian’ observables: (i) 
advance of periastron, (ii) apparent modification of the proper time of 
the pulsar by combined Doppler and gravitational effects, and (iii) rate 
of change of the orbital period of the pulsar linked to gravitational radi- 
ation damping. Comparison of these three ‘post Keplerian’ observables 
with their General Relativity prediction leads to an accurate (107° level) 
confirmation of Einistein’s theory in a regime that involves both strong- 
field effects (in the neutron star) and radiative effects. Other binary 
pulsars!’ have recently allowed further accurate confirmation of GR. 


V.6.3 Escape velocity 


As an exercise in the physical interpretation of General Relativity, we 
compute the radial velocity with respect to an observer at rest in the 
Schwarzschild metric that must be applied to a test object for it to 
escape the gravitational attraction. 

Let ro be the r coordinate of the static observer in a Schwarzschild 
spacetime with mass m < ro/2. Denote by řo = dr/ds(0) the proper- 
time initial velocity, supposed to be radial, of the launched rocket at 
parameter time t = 0. This rocket, supposed to be in free motion after 
its launch, follows a radial geodesic curve and hence satisfies, by (6.6), 
the equation 


(1-*)i-e- (1-=) to. (V.6.17) 


We also have, as a result of the definition of ds, 


=i 
1=(1 me (1 =) P, (V.6.18) 
r r 


Hence 


tsi: (V.6.19) 


The rocket can attain a maximum of the parameter r (and then turn 
back) when 7 = 0, that is, when r takes the value 


o 2m 
1- E2?' 


The number rm is an attained maximum of the parameter r if it is 
positive and finite, that is, if Æ < 1. The escape velocity for which rm 
is infinite corresponds to Æ = 1, and hence, by (6.16), to 


TM (V.6.20) 


S d (V.6.21) 
ro 
The relativistic escape velocity V for the observer at rest is given by 
the ratio of the space (radial) and time components V! and V° in the 
proper frame of this observer of the velocity vector, which has compo- 
nents (79,9) in the natural frame of the coordinates t,r. The proper 
frame of the static observer is 


1 1 
2 2 2 -3 
po = (: = =) dt, o= (1 . =) dr, (V.6.22) 
ro ro 
and therefore 
1 
2 72 2 2. 
vi= (1 2 =) tp, VO= (1 = =) fo. (V.6.23) 
TO TO 
Hence, using (6.20) and (6.18) when E = 1, we obtain 
Vv! 2m 
V =: — = 4| —. V.6.24 
a (V.6.24) 


This escape velocity coincides with its Newtonian value. It tends to 1, the 
velocity of light, when ro tends to 2m, in agreement with interpretations 
that we will give in a forthcoming section. 


V.7 Stability of circular orbits 

To study the stability of circular orbits, we use the identity 
Jape? = —1 (V.7.1) 

and (6.4) and (6.6) to obtain 


i, i am\ [2 ly 
ag bt ee. 72 
2" 5 ") (4 ) 2 tee) 


This type of differential equation occurs in classical mechanics and 
governs the motion of a particle of unit mass and energy 1p? in the 


potential!4 
1 2m L 
Vin =5 (1 : ) (5 | 1). (V.7.3) 
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14Our consideration of the dynamics of 
a test particle by means of an effective 
potential can actually be generalized 
to the case of the motion of a binary 
system of comparable masses by us- 
ing the ‘effective one-body’ approach 
to two-body systems (see Buonanno 
and Damour, 1999; Damour and Nagar, 
2011). 
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By differentiating (7.3) with respect to s, we obtain the following second- 
order differential equation for r 


r+—=0. (V.7.4) 
A circular orbit r = ro is one for which è = 0; hence 7 = 0 and 


(dV /dr)(ro) = 0. That is, for such an orbit, the potential V has a critical 
point. Computation gives 


—=r 4 (mr? —0r+3ml?). (V.7.5) 


The critical points are therefore given by 


24 /F—12Pm? 
= T; (V.7.6) 


2mr2 


For l? < 12m?, there is no circular orbit. For ¢? > 12m?, there are two 
possible circular orbits: ro = R, and ro = R_. The circular orbit r = ro 
is stable if the critical value ro is a minimum of V; it is unstable if ro 
is a maximum. Indeed, linearization around ro of dV /dr shows that the 
equation (7.4) leads to oscillations of r around ro if (d?V/dr?)(ro) > 0 
and exponential growth of r if (d?V/dr?)(r9) < 0. Elementary calculus 
shows that R— is a maximum of V and hence the orbit rọ = R_ is 
unstable, while R4 is a minimum and hence rp = R+ is stable. 

We see from (7.6) that, given m, the smallest possible value of R is 
6m, while the smallest possible value of R— (obtained when £ tends to 
infinity) is 3m. We have proved the following theorem: 


Theorem V.7.1 In a Schwarzschild spacetime of mass m, there is no 
circular orbit with angular momentum £ less than my12. The Schwarz- 
schild coordinates r of circular orbits all satisfy r > 3m. The last stable 
circular orbit has r > 6m. 


V.8 Deflection of light rays 


V.8.1 Theoretical prediction 


Light rays are null geodesics, so they are bent by the curvature of the 
spacetime. The differential equations they satisfy are the same as the 
equations for timelike geodesics, except that the derivative denoted by a 
dot is now a derivative with respect to the affine parameter on the null 
geodesic and (7.1) is replaced by 


agtt = 0. V.8.1 
Jog 


That is, using (6.4) and (6.6), we have 


2 
leyi (1 x) "a (V.8.2) 


r2 


We deduce from this equation and from (6.4) that on a null geodesic, 


1 fdr\*? 1 2m , 
4 (=) zo pT k“; (V.8.3) 
with k = £?¢~? a constant. Setting u = r7! gives 
du * 2 3 2 
ae +u’ —2mu° =k’. (V.8.4) 
Hence, after differentiation, we obtain 
d? 
+u = 3m. (V.8.5) 


The term 3mu? in (8.5) is the Einsteinian correction to the following 
equation: 

du 1 
This gives, in the absence of gravitation, straight lines as light rays, with 
equations in spherical coordinates taking the form 


1 1 
= = Usphr(Y) = — cos(y — p0), (V.8.7) 
T To 


where ro is the displacement from the centre. If we consider the Einstein 
correction 3mu? as small, we obtain an approximation to Einstein’s light 
rays by solving the differential equation 
du 
dy? 
Setting yo = 0 for simplicity, we find that the general solution to (8.8) 
is 


+u = 3muk pnr- (V.8.8) 


t= pk “Geen? y). (V.8.9) 
r To ré 
On the straight line given by (8.7), the coordinate r tends to infinity 
when @ tends to —1/2 or 7/2. On the curve given by (8.9), r tends 
to infinity when ¢ tends to +(7/2 + a) with, by (8.9) and elementary 
trigonometry, a satisfying 


—sina+ ™ (1 + cos? a) =0. (V.8.10) 
ro 
That is, if œ is small, 
2 
—— (V.8.11) 
To 


The total deflection of a light ray is therefore estimated to be 
4 
jae (V.8.12) 


ro 


For light rays grazing the solar surface, this angle is about 1.75”. 
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15Ferrarese (2004). 


V.8.2  Fermat’s principle and light travel parameter 
time 


We have said that light rays follow null geodesics of spacetime. The 
following property is a generalization to static spacetimes of a classical 
theorem of Fermat in Newton’s space E’. 


Theorem V.8.1 In a static spacetime with metric 
ds” = —9Joo dt? + gigdx' dx! , 


the projections on space of light rays are geodesics of the Riemannian 
metric 


= gigdu' dx) ; 


2 


do? 
Joo 


i.e. they are relative minima of the integral 


ti Joo 


On a null curve, £ is the parameter-time duration. 


Proof. This is a straightforward computation, using the geodesic equa- 
tions in ds and do. We leave it as an exercise. E 


In addition, we note that Fermat’s principle can be generalized to 
arbitrary spacetimes as follows. 15 


Theorem V.8.2 The ‘proper time of arrival’ of null curves to a given 
timelike curve admits a critical point (is a relative minimum) at a 
light ray. 


V.8.3 Results of observation 


It was first necessary to wait for an eclipse to be able to observe the 
apparent displacement of the stars due to the deflection by the Sun of 
light coming from them. As early as 1919, an expedition was organized 
by Eddington to measure the bending of light by the Sun. The observed 
deflection was in reasonable agreement with the prediction but was not 
very precise, until data from the Hipparcos satellite verified the deflec- 
tion of light to a 107° precision. More precise results have long since been 
obtained with the use of radio waves: it was not necessary in this case to 
wait for an eclipse. Measurements using several radio telescopes at inter- 
continental separations (very long baseline interferometry, VLBI) now 
give a precision of about 1074. They strongly support General Relativ- 
ity, in particular in constrast to the Jordan—Brans—Dicke scalar—tensor 
theory. 


V.9 Redshift and time dilation 


The time dilation and redshift effects are not due to a change of the 
velocity of light in a gravitational field—in General as well as in Special 
Relativity, the speed of light is a universal constant. Rather, the effects 
are caused by dependence on the observer of the measure of the proper 
time (see Chapter III). 


V.9.1 Redshift 


We have already considered the gravitational redshift in Chapter III, in 
any static spacetime. Its measurement in a Schwarzschild spacetime is 
one of the classical tests of Einstein’s equations. 

Suppose for simplicity that the emitting atom and the observer are 
both at rest with the same angular coordinates, and with radial Schwarz- 
schild coordinates r4 and ro, respectively. The emitted period T4 and 
the observed period To are then linked by the relation (see (III.8.5)) 


To = (1 = =) (1 = 2) Ts (V.9.1) 


Hence To > Ta if ro > ra, and so a redshift (smaller frequency v) of 
spectral lines is then observed. If m/ro is small, then 


T 1 1 
Azi -m (- - J (V.9.2) 
A O 


This formula shows that the time between signals emitted regularly in 
its own time by a source located at a point with parameter ro tending 
to the horizon 2m seem progressively longer to a stationary faraway 
observer; it tends to infinity as the source tends to the horizon ro = 2m. 
This effect, already noticed by Oppenheimer and Snyder, is called the 
infinite-redshift effect. 


V.9.2 Time dilation 


To state the elements of the theoretical prediction of time dilation, we 
treat the case of a rocket sent from the Earth in a radial direction with 
a velocity less than the escape velocity and moving then in free fall, i.e. 
following a timelike geodesic of the Schwarzschild metric representing the 
Earth’s gravitational field. The parameter time tm it takes the rocket 
to fall back to Earth is twice the parameter time it takes to attain its 
maximum rm value (see Section V.6.3). We calculate the proper time 
using the formula 
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mtg, 
— r — 
= pe an gt (V.9.3) 


ro 
with 


(2))=6-2)-(-2)" of v 


We set (dr/dt)(0) = v and we obtain for s4, when m/ro is small, the 
approximate expression 


tm 2 
at (1 ARN Ra ) dt. (V.9.5) 
0 


r ro ro 


The proper time observed on Earth at the fixed ro between the origin 
and the impact point is 


s= f” ji- ae f" (1-2) dt. (V.9.6) 


Therefore, a standard clock carried by the rocket shows a delay over the 
same standard clock of the observer, given approximately by 


tm 2 
sa-so= f ( mM gangy ) dt > 0, (V.9.7) 
0 


F ro ro 


since r > ro on the trajectory. Experiments made with caesium clocks 
have confirmed this time delay and its estimates with great precision. 


Remark V.9.1 On a Lorentzian manifold, timelike geodesics are, as 
in Minkowski spacetime, local mazima of the length of timelike curves 
joining two points. In the given example, it is the rocket that follows a 
geodesic, while in the twin paradox (see Chapter II), it is the travelling 
twin who is not in free fall—he has to use an engine to come back. 


V.10 Spherically symmetric interior 
solutions 


A relativistic model for a spherically symmetric isolated star is a spher- 
ically symmetric spacetime (V = R? x R,g) such that g satisfies the 
Einstein equations on V, 


1 
ab T zIapR = Tab, 


where Tag, the source stress-energy tensor, is zero outside the star and 
depends inside it on the star’s physical constitution—this is not well 


Sag = R 
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known and is difficult to model by a single formula. For this reason, 
there is no physically reliable exact solution for the considered problem. 
However, important qualitative features can be obtained from general 
considerations. 


Remark V.10.1 As in Newtonian theory, a spherically symmetric 
Einsteinian gravitational field vanishes in the interior of a hollow 
sphere—that is to say, namely a spherically symmetric vacuum solution 
defined for r < a is necessarily flat in this domain. Indeed, the construc- 
tion of Section V.3 shows that such a solution is a Schwarzschild metric 
(3.1), with m some arbitrary number. The only solution of this form that 
is continuous at r = 0 is the flat solution with m = 0. 


V.10.1 Static solutions. Upper limit on mass 


We look for equilibrium configurations, i.e. static spacetimes. We use 
Schwarzschild coordinates. For r > rstar, with rstar the radial Schwarz- 
schild coordinate of the star’s boundary, we have vacuum; hence the 
solution is Schwarzschild. For r < Tgtar, we again look for a static 
spherically symmetric metric, of the form 


g = —e” dt? + eò dr? + r7(dé? + sin? 6 dy’), (V.10.1) 


but now we solve the equations with a non-vanishing stress—energy ten- 
sor, which we take, in the absence of better modelling, to represent a 
perfect fluid; that is, the Einstein equations are 


Sop = Tap, with Tag = (u + p)tetg + P9ap; (V.10.2) 
equivalently, 
1 
Rag = pag = (U + p)taug + 5(u — P) Jag. (V.10.3) 


For a static solution, the fluid unit velocity is uê = 0, u? = e : 
tangent to the timelines. Then To; = Ro; = 0, 


Too = pe”, Roo = eiu +37); Ty =Peu, Ry = su = p)gij- 
(V.10.4) 
The scalar functions u and p, like A and v, depend only on r. 
The identities found in Section V.3 give (a prime denotes differenti- 
ation with respect to r) 


1 


Roe = 
sin? 0 


R33 = e* [1 t (v A’) t 1, 


r(e*—” Roo + Ri) = yp’ + x; 
and for the spacetime scalar curvature 


R=-e "Rote Ri +r 7(Roo + sin? 6 Rz). (V.10.5) 
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16We will see in Section VIII.7.2 that 
the fact that the component S$ does 
not contain the lapse (here e”) is a 
general property of the Einstein tensor. 


Ly v2 p! l w =N 2y, 
=-e (v's 5 | re aoe (V.10.6) 


Elementary computations give 


1 1 i ai= 
Soo = Roo t zE R = z [Poo t eR + er? [Roo + (sin@) ?Rs3]}. 


The equation Soo = Too = pe” is then found to reduce!’ to a differential 
equation for À that can be written as 


1 r 
1-—(er) =r?°u, hence e~* = - r [1 — p?u(p)] dp + constant 
0 
(V.10.7) 


Smoothness of the metric in a neighbourhood of the origin r = 0 of polar 
coordinates imposes the vanishing of the integration constant; hence 


2M 1 [* 
ee 1= eM) with M(r)= F e ulp) dp. (V.10.8) 
0 


T 


This solution defines a Lorentz metric in a domain r < a only if 
2M (a) = J pPulp)dp <a. (V.10.9) 
0 
Having computed A, we use the equation 


v! +A = r(eè™ Roo + Ru) = r(eò™” poo + p11) = reò(u +p) (V.10.10) 
to find that 


_ 2M(r) +1°(p— u) 
r[r—2M(r)] 


On the other hand, the fluid equation (cf. Chapter IV) gives 


(V.10.11) 


v = (eò — 1)r™! + reò(p — u) 


(u+ p)u”Vauı + Op = (u + p)u’uo h, + Arp = 0; 
that is, 


/ 


1 
p =- +u, 
and we find that 


py , „2M(r)+r?(p- u) 
P =-3Pt OMG) 


(V.10.12) 


This is known as the Tolman—Oppenheimer—Volkov equation of 
relativistic hydrostatic equilibrium. It can be integrated if one assumes 
some equation of state inside the star. In most situations, it is quite 
difficult to know with any reliability an equation of state, because of the 
complexity of the phenomena that occur. However, one can deduce some 
conclusions from this formula. 

When the density u is a constant, uo, the integral (10.8) gives 


i 1 
2M(r) = no | p’ dp = zor”. (V.10.13) 
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Equation (10.12) then reduces to a differential equation for p that can 
be integrated exactly. Assuming that p vanishes at the boundary r = a 
of the star, one finds for the pressure at its centre 


1 — [1 — 2a- M(a)]? 


0) = i V.10.14 
P(0) M031 —2a-1M (a)? —1 ( ) 
This pressure becomes infinite if 
=i 1 $ 4a 
3[1 — 2a M (a)? =1, ie. M(a)= p (V.10.15) 


The pressure p(0) becomes negative if M (a) > 4a/9. 
Exercise V.10.1 Prove these results. 


The classical conclusion is that stars with M (a) > 4a/9 cannot exist. 
The same result holds!” if one supposes only that ju is a non-increasing 
function of r. 


Theorem V.10.1 There exist no equilibrium configuration of spheric- 
ally symmetric stars filled with a perfect fluid with a non-increasing 
function of r and such that 


M(a) > 4a 


Z 10.1 
Bi (V.10.16) 


where a is the standard radius of the star and M(a) is given by 


M(a) = IE dp. 


V.10.2 Matching with an exterior solution 


An Einsteinian model for the exterior and the interior of a spherically 
symmetric star is a manifold R3 x R with a Lorentzian metric g that 
satisfies the Einstein equations on the whole manifold, induces an interior 
Schwarzschild metric in B x R, with B a ball of R? filled with matter, 
and an exterior, vacuum, Schwarzschild metric in the complementary 
domain. The metric, spherically symmetric, reads on R? x R 


g = —e” dt? + & dr? + r°(d0? + sin? 6 dy’, 


with eò and e” equal to the coefficients of the interior metric gstar for 
r <Tstar, denoted by a above, and to those of a vacuum Schwarzschild 
metric in the exterior. For such a metric to be a solution on R? x R, 
some continuity!® properties are required from the functions À and v at 
the boundary r = a between the interior and exterior regions. Equation 
(10.8) in particular imposes that 

2M (r) 


e ^st — 1 — 2 =e Amt ~ 1 2 for r=4Q4; 
r ta 


17 The integration was first performed 
by Schwarzschild in 1916. For details, 
and a discussion of the case of a 
non-constant density, see Wald (1984), 
p. 129. 


18If these quantities were not continu- 
ous, their differentiation would intro- 
duce a measure with support r = rstar 
in the components of the Ricci tensor 
that are not present in the source. 
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that is, 


m= M(a)= F r°u(r)dr. (V.10.17) 
0 
The number M(a), sometimes called the mass function, can be com- 
puted in terms of the density in classical units by reinserting in the above 
formula the Einstein gravitational constant Gg = 87Gy. This leads to 
the integral of the density on the volume of a sphere B, of radius a, 
namely 


M° (a) = ney | r? U8 (r) dr. 
0 


Note, however that M (a) is not the proper mass of a static spherical star 
of standard radius a, which should be computed with the space proper 
volume element 


r2e2>dr sin 6 dé dọ, 


resulting in a smaller quantity; the difference represents the gravitational 
binding energy of the star in equilibrium. 


V.10.3 Non-static interior solutions 


Birkhoff’s theorem does not apply to interior solutions. There exist time- 
dependent, spherically symmetric solutions of the Einstein equations 
with sources in a domain r < fstar; the radius fstar of the star may be a 
function of t. The full solution is the considered interior solution gint for 
r < Tstar and a Schwarzschild exterior solution with mass m for r > Tstar, 
provided that rstar > 2m, the Schwarzschild radius of the star, which is 
constant for an exterior solution and is linked with the energy content 
of the star, which is considered as an isolated object. When the star 
contracts so much that festar becomes smaller than 2m, it is no longer 
observable from the region r > 2m of the spacetime (see Chapter VI). 

It is physically clear that there can be no static interior solution with 
zero pressure, because there is nothing in that case to resist the gravita- 
tional attraction. This can be checked mathematically by solving (10.12) 
with p = p' = 0, which gives 


u(r) = 2r’ M (r) = 2r-3 a Pulp) dp, (V.10.18) 


that is, setting y(r) = fo p?u(p) dp, the differential equation 


/ 


2 2 
y =-y, hence y=Ce’ , 
F 
with C a constant. Then 
=2 r° 


u= Cre”, (V.10.19) 


which is always infinite for r = 0. 
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V.11 Spherically symmetric gravitational 
collapse 


It is believed that when a star has exhausted all thermonuclear sources 
of energy, it will collapse under its own gravitational field. Oppenheimer 
and Snyder! made a rigorous study of this process for a spherically 
symmetric dust cloud by using a special solution given by Tolman?’ in 
the restricted case of a spatially constant matter density starting from 
rest. This study was revisited by Gu?! with the restriction to spatially 
constant matter lifted. Gu analysed shell-crossing singularities as well 
as the essential singularity at the centre of symmetry. He gave a de- 
tailed mathematical discussion of the formation of horizons and of what 
is now called the nakedness of the singularities. Hu?? constructed the 
fully general spherically symmetric solutions for dust, not restricted to 
the Tolman class, and discussed their properties. The work done by Gu 
and Hu for non-homogeneous dust clouds was repeated, in ignorance 
of these papers published in China, by Müller zum Hagen, Seifert, and 
Yodzis?*, who proved the existence of shell-crossing singularities in the 
case of the Tolman class; later, Christodoulou proved the existence of 
central singularities,?* starting from rest. Finally, Newman?°> obtained 
the general solution. These western papers were motivated by the cos- 
mic censorship conjecture, which is violated both at shell-crossing 
singularities and at the centre, as was already shown in Gu’s paper. 


V.11.1 Tolman, Gu, Hu, and Claudel-Newman metrics 


In the case of dust, the flow lines are the timelike geodesic trajec- 
tories of the particles; we take these geodesics as timelines (comoving 
coordinates). A spherically symmetric metric can then be written as 


— dt? +e” dr? + R?(d6? + sin? 0 d¢”), (V.11.1) 


where w and R are functions of r and t. The flow lines of the dust are 

the curves on which the coordinates r, 0, ġ are constant; a so-called dust 

shell is labelled by its coordinate r. The metric is regular as long as e?” 

and R are smooth positive functions. R? is (up to multiplication by 47) 

the area of the spherical dust shell of parameter r, at time parameter t. 

One fixes the parameter r by choosing it to be such that r = R(r, 0). 
The stress-energy tensor T of the dust reduces to 


Tag = Ug, (V.11.2) 


with u“ = 65, since the timelines are the matter flow lines. The only 
non-zero component of the stress energy tensor is 


Too =H, (V.11.3) 
with 
u= (r,t) for r<a and p(r,t)=0 forr>a, (V.11.4) 
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19QOppenheimer and Snyder (1939). 
20Tolman (1934) and, Bondi (1947). 


21Gu (1973). 


22Hu (1974). 


23Müller zum Hagen, 
Yodzis (1973). 


24Christodoulou (1984). 
25Newman (1986). 


Seifert, 


and 
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26Which may not remain the outermost 
shell if there is shell crossing as time 
evolves. 


27Note as before that a factor 47 has 
been included in our definition of dens- 
ities; the integral giving M(r) is taken 
over a 3-sphere. 


28Hu (1974) and Newman (1986). 


where a is the r parameter of the outermost dust shell?° at time t = 0. 

We denote by primes and dots respectively differentiation with respect 
to r and t. We compute the general solution in the chosen notation. The 
equation Rio = 0 is equivalent to the second-order equation 


R'—wR' =0, (V.11.5) 
which integrates to the first-order equation 
R'e™” = f(r), (V.11.6) 


with f(r) an arbitrary function of r. 
The conservation equations reduce to 


Vol™ = a +r% = ù+ ulo + 2R +R) = 0; (V.11.7) 


hence, with ¢ an arbitrary function of r, we obtain 


e 


p(t,r) = Fe C) (V.11.8) 
Using (12.6), this equation becomes 
r? po 
p(t,r) = eR (V.11.9) 
Here uo is an arbitrary function of r that we identify with the initial 
density because we have chosen r such that R(0,r) = r and hence 


R'(0,r) = 1. It has been shown by R. Newman that the Einstein 
equations then admit the first integral 


1 -2 M(r) _ 1 2 
sR- SE = SP (r) - 1) (V.11.10) 
with 
M(r) = fi " F(p)ult, p) R(t, p)e dp. (V.11.11) 


The function M (r) is the integral over the volume occupied by the dust 
shells 0 < p < r of the density weighted by the factor f(p).?” It is 
independent of t, as can be verified by using (12.6) and (12.9), which 
show that 


M(r) = [ voto? dp. (V.11.12) 


The explicit formulas for R(r,t) and w(r,t) depend on the sign of 
f(r) — 1; they have been computed and studied in the general case by 
Hu and Newman, to whom we refer the reader.?® The case where the 
dust cloud starts from rest, i.e. 


. M 
R(0,r) = Taa + f(r) -1=0, (V.11.13) 
has been studied by Christodoulou, using parametric equations for R 


and t. 


V.11  Spherically symmetric gravitational collapse 


Here, following Gu?? we study the case f?(r) = 1, in which exact 
integration permits a clear and comparatively short discussion. In the 
case f?(r) = 1, (11.10) reads 


R?R=+4\/2M(r). (V.11.14) 


We suppose that the star starts contracting; then the minus sign must 
be chosen when t increases, and the equation integrates to 


R(r,t)? = (r) — 5 VIM (rt, 


with ® an arbitrary function. Since we have normalized the radial par- 
ameter r by the condition R(r,0) = r, it holds that ®(r) = r2. We use 
the notation 


; 2M(r) =: h? (r), (V.11.15) 
and then 
R(r,t) = [r? — h2(r)t]3. (V.11.16) 
Using (11.6), we then find 
e” = R! = |r? — h? (r)t] ~$ [r? z sho tnl(n)t 
Here, as a consequence of the definition (11.15), we have 
hi (r) = Žr? u(r). 
On the other hand, the equation for u gives 
u(t,r) = ii (V.11.17) 


fr? RAC) elle? — GAENE 


We make a change of coordinates: instead of choosing as radial parameter 
the number characterizing a dust shell, we take the area R of a dust shell 
at time t. Since we have chosen f(r) = 1, i.e. R’ = e”, it holds that 


dR = e” dr + Rdt = e” dr — R7? \/2M(r) dt. (V.11.18) 
The change of coordinates from r,t to R,t is admissible if R’ does 
not vanish. In the new coordinates, the spacetime metric becomes, for 
O0<r<a, 


2M 
= h = ve) dt? +2R~2\/2M(r) dR dt+dR?+R?(d6?+sin? 6 d¢?). 


(V.11.19) 


We see that the metric of the space sections t = constant reduces to the 
Euclidean metric in polar coordinates. If the space manifold is, for every 
t, homeomorphic to a ball of three-dimensional Euclidean space and R is 
an admissible polar coordinate, then R(t,r) is also the distance at time 
t between the centre and the dust shell with parameter r. 


29Gu (1973). 
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30Shell crossing, leading to non-central 
singularities, exists when the density 
is not monotonically decreasing. See 
Gu (1973), Hu (1974), Müller zum Ha- 
gen et al. (1984), and Newman (1986). 


31Vanishing of R’ signals a shell 
crossing. 


V.11.2 Monotonically decreasing density 
Collapse of dust shells 


In this subsection, we suppose that uo is a monotonically decreasing 
function of the parameter r, as r increases from 0, the centre of the star, 
to a, the value of r at the surface boundary of the star at time t = 0. 

To study the possible collapse, we study the evolution in proper time 
t of the function R(t, r) for a given dust shell, i.e. for a given value of r. 
We see from (11.16) that the shell collapses at the centre of symmetry 
at the time tı (r) where R(t1,r) = 0; that is, 


ti(r) =h-2(r)r2. 
It follows that 


r2h-? [Bh(r) — rh!(r)] = orth} [3M (r) —rM'(r)]. 
(V.11.20) 


If uo is monotonically decreasing, then, using the definition (11.11) of 
M(r), we find that 


3M(r) —rM'(r) = 3 | Lo(p)p” dp — po(r)r? > 0. (V.11.21) 
0 
Hence the shells with increasing parameter r arrive successively at the 
centre, and there is no shell crossings.°° If the density is uniform (the 
Oppenheimer—Snyder case), then the dust shells all arrive at the same 
time at the centre. 
The metric (11.19) is a regular Lorentzian metric if the linear form 
dR — Rdt does not vanish, i.e. if R’ = e” > 0. We deduce from the 
expression (11.16) for R that 


1 
RIR = $, = sha (rh (ryt =0. (V.11.22) 


Hence R’(t,r) vanishes’! at a time tz given by 


h2(r)r2 2h2 /2M 
ipa ave = iD) zA n (V.11.23) 
h'(r) 3r? uo(r)  ržpo(r) 
We note that 
ty h'(r)r 
— = i .11.24 


The monotonicity of uo and the mean value theorem led Gu to the 
conclusion that 


that is, the dust evolution does not induce a singularity in the metric 
before a dust shell arrives at the centre, in agreement with the previous 


V.11 Spherically symmetric gravitational collapse 


conclusion of the absence of shell crossing. The first occurence of the 
singularity is at a time to, with 


r3 2 1 
to = s a] eR 


the greater j19(0) is, the sooner the singularity appears. 
Matching with an exterior metric 


The interior metric is given by (11.19) for r < a, ice. 


2 


3 


0 < R< fa? — h? (a)t] (V.11.27) 


The computations that we have made are still valid outside the star, 
where r > a and uo(r) = 0, but for r > a it holds that 


M(r) = Ma =| uol)? dp, r>a, (V.11.28) 
0 


and the exterior metric reads 


(1 x) dt? +2 = dR dt + dR? + R?(d0? + sin? 6 dẹ’). 
(V.11.29) 


It is a boosted Schwarzschild metric with horizon 
R=2M,. (V.11.30) 


This solution takes the usual Schwarzschild form 


2M. 2M, 
—{1-——*) d? + |1- 1 
Faea 


=i 
) dR? + R?(d6? + sin? 6 do?) 


(V.11.31) 
if we make the change of time coordinate 
2M,\* [2Ma 
dt = dt 1 dR. V.11.32 
r=u- (1-3) y4 (V.11.32) 


The full spacetime metric g is defined on R? x Rt, with R,0,¢ polar 
coordinates on R° and t € Rt (i.e. t > 0) by 


2 
g= gas OSRE fa? — h? (aje *, (V.11.33) 


2 
9=Gexr, R2 |a? - hè (ael (V.11.34) 


The interior dust solution is hidden behind the horizon when 
[a? as h? (aJt! 5 < 2Ma. (V.11.35) 
This begins at a positive time t3 given by 


ts = h73(a)[a? — (2M,) 


3 
2 


] (V.11.36) 
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if a > 2M,, that is, if the star is initially visible. We have seen that 
the outer shell of the star collapses into a singularity when tı(a) = 
h-2(a)a2, hence after the star has ceased to be visible. This fact was an 
inspiration for the formulation of the cosmic censorship conjecture 
by Penrose (see Chapter VIII). 


V.12 Problems 


V.12.1 Relativistic and Newtonian gravitational 
masses 


Show that the relativistic gravitational proper mass of a spherical star 
with radius a and density u(r) is larger that its Newtonian gravitational 
mass. 


Solution 


The Newtonian gravitational mass is, with Gy the Newtonian gravita- 
tional constant, 


M(a) = Gn f J P(r) drsin0 dr d9 do = 4nGy | r? u(r) dr. 
2J0 0 


The relativistic gravitational mass, computed with the proper volume 
element, is 


dr. (V.12.1) 


r 


MaS GR f ao) h 2 = 2 


The proper mass M, is greater than M (a) because [1 — 2M(r)/r]~? > 1. 
The difference represents the gravitational binding energy of the star in 
equilibrium. 


V.12.2 The Reissner—Nordstr6m solution 


Show that the Reissner—Nordstré6m metric (found in 1916 by 
Reissner) and given in standard coordinates by 


2 2 2 a 
(1 ue 4 4 ) dt? + (1 pall ae ©) dr? +r? (dé? + sin? 6 dy”) 
r r T T 
(V.12.2) 


is a spherically symmetric solution of the Einstein-Maxwell equations 
with electromagnetic potential 


A;=0, A= =S, (V.12.3) 


identical with the classical electrostatic potential of a spherical body 
with charge Q. 

Show that the metric (12.1) is smooth and Lorentzian with t a time 
variable, provided that 


pie (V.12.4) 


Remark V.12.1 For large r, the term Q/r? is small in comparison 
with m/r. Since the total charge of celestial bodies is observed to be neg- 
ligible in comparison with their mass, the Reissner—Nordstrom solution 
has little application in astrophysics. It was also abandoned as a possible 
model for the electron, for which m~'Q? = 2.8 x 10713 cm, and so the 
influence of the term r~?Q? would be important only at distances where 
quantum effects cannot be neglected. 

The Reissner—Nordstrém solution has attracted interest in connec- 
tion with modern aspects of mathematical General Relativity. Indeed, if 
m? > Q?, it possesses two event horizons (see Chapter VI) given in 
standard coordinates by 


cmim. (V.12.5) 


It would invalidate a censorship conjecture if the word ‘generic’ was not 
included in the hypotheses (see Chapter VIII). 

Reissner-Nordström-type solutions have recently become important in 
string theories. 


V.12.3 Schwarzschild spacetime in dimension n + 1 


Construct by reasoning and computations analogous to those for 
n+1=4 the (n + 1)-dimensional Schwarzschild metric in spherical 
standard coordinates, r € Rt, 


2m 2m \ T! 
JSchw = (1 z) dt? 4 (1 =) dr? +r? dus”, (V.12.6) 


rr 


where dw? is the metric of the sphere $"~!. The Schwarzschild spacetime 
is defined as for n = 3 by this metric supported by the manifold M x R, 
with M the exterior of the ball r”~? = 2m. 


V.12.4 Schwarzschild metric in isotropic 
coordinates, n = 3 


Show that in isotropic coordinates the four-dimensional Schwarzschild 
metric reads, with R? = X? + Y? + Z?, 


2R—m m\4 
ir- dt? + [14 X24+dY?+dZ?). (V.12. 
oan (are) ( IR) (A EE E Steet) 
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Solution 


Recall that, in standard coordinates, 


2 2m\~! 
9Schw = (1 z) dt? 4 (: z) dr? + r?(d6? + sin? 0 d¢?). 
r r 


(V.12.8) 
The relation between r and R is 
m\2 4R? +4Rm+ m? 
-R14 ) = 12. 
re ( 2R 4R ee) 
and hence the coefficient of dt? becomes 
12m _,___ 8m _ (2R-m)\" 
ro (2R+m)2 \2R+m 
To compute the space metric in isotropic coordinates, we first remark 
that 
i 2m\ T! _ r _ (2R+m a 
r = r=2m \2R—m 
and 


(V.12.10) 


We then proceed as follows. From the definitions 


RUX=r ‘2, RIY = rly, Z=Rr-1z, and 
Re +Y? +2’) z= F(a" +y? +2), 
(V.12.11) 
it results that, 
d(r™tz) = r dz — r`°zdr = R~! dX —R?XdR 
with analogous relations for y and z, and hence 
r°? (dz)? — 2r-32 dz dr + r—4a?(dr)? = R? (dX)? 
— 2R’ X dX dR+R-*X7(dR)’, 
and, by summation and simplification, 
r *(da? + dy? + d2? — dr?) = R? (dX? + dY? + dZ? — dR’). 
Using 
r?(d0? + sin? 0 do?) = dz? + dy? + dz? — dr? 
= r*R-*(dX? + dY? + dZ? — dR’) 


V.12 


shows that the space part of gschw can be written as 


(Fe 


2 2 2 4 
m 2 m 2 2 2 Jp2 
atm) (1 7) dR + (1455) (dX? + dY? + dZ? — dR?), 


and, after simplification, the Schwarzschild metric is obtained in iso- 
tropic coordinates. 


V.12.5 Wave coordinates for the Schwarzschild metric 
in dimension n + 1 
Show that the requirement that x” = (t,x) be wave coordinates, 
gx" =0, with zê = 7(r)n’, n’ € S”, reduces to the equation 
d |\dr 
dr | dr 


A*F—(n—-1)F = 


ime Ol 2") —(n—-1)r = 0, (V.12.12) 


with A* the Laplacian on the sphere $”~!. 
Show that setting s = 1/r gives as equation singular at s = 0: 


= sa — oms") = (n= 1)s "F. 


ds 
Show the asymptotic expansions 
Foira m i mr- Ine +O(r5lnr), n= 4, 
' (n—2)rr-3 | O(r5=?”), n>5J` 
Solution 


See Choquet-Bruhat, Chruściel, and Loiselet (2006). 
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Singularity 


Fig. VI.1 Schwarzschild black hole. 


Black holes 


VI.1 = Introduction 


Laplace and Michell had already foreseen that light can become trapped 
by a massive body, so that the latter becomes black to observers and is 
perceptible only through its gravitational field. However, the apparent 
‘Schwarzschild singularity’ r = 2m is a phenomenon that has no ana- 
logue in classical mechanics, and the completed spacetimes are indeed 
very strange. It was long believed that these extensions have no physical 
reality, i.e. that matter cannot be compressed so that it is included in 
the region r < 2m. It was mainly through the vision of Robert Oppen- 
heimer and John Archibald Wheeler that the reality of black holes was 
seriously considered. They cannot be seen directly, but astronomical ob- 
servations, in particular perturbations of motions of various stars, reveal 
gravitational fields too strong to be explained by an invisible massive 
companion other than a black hole. Some X-ray sources and active gal- 
actic nuclei are interpreted in terms of black holes. Also, some y-ray 
bursts are thought to be due ultimately to the explosion of matter ac- 
creted near the boundary of a black hole by the gravitational field it 
generates. There is now a large consensus among astrophysicists on the 
existence of many black holes in the universe, even at the centre of our 
own galaxy. 


VI.2 The Schwarzschild black hole 


The first model of a black hole appears in the first exact solution of the 
Einstein equations constructed by Schwarzschild in 1916, the spherically 
symmetric one that we have studied in Chapter V (Fig. VI.1). We have 
defined the Schwarzschild spacetime as the manifold r > 2m in R? x R, 
with r a polar coordinate in RÌ, with Lorentzian metric given in standard 
coordinates by 


dt? + (= — 1) dr? + r?(d6? + sin? 6 dd”),  (VI.2.1) 


and we have called the submanifold r = 2m of R? x R, diffeomorphic to 
the product $? x R, the event horizon. 

The Schwarzschild metric in standard coordinates with m > 0 ceases 
to be a smooth Lorentzian metric for r = 2m; at this value of r, the 
coefficient goo vanishes and g1; becomes infinite. 
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For 0 < r < 2m, the Schwarzschild metric in standard coordinates is 
again a smooth Lorentzian metric, but t is a space coordinate while r is 
a time coordinate. 


Exercise VI.2.1 Show that the Schwarzschild metric with 0 < r < 2m 
is neither spherically symmetric nor static. 


The volume element of gschw in standard coordinates, 
r? sin dr d@dydt, is smooth! and non-vanishing. More importantly, 
a straightforward computation shows that the coordinate-independent 
scalar ROY? Raupys: called the Kretschman scalar, is finite at r = 2m, 
in fact for all r > 0, being equal to 48m?/r°. This property led to 
the belief that the Schwarzschild spacetime (S$? x {r > 2m}) x R is 
extendible, in the sense that it can be immersed in a larger Einsteinian 
spacetime, whose manifold is not covered by the Schwarzschild standard 
coordinates with r > 2m. 


VI.3 Eddington—Finkelstein extensions 


Let us consider (following Eddington in 1924, Lemaitre in 1926, and 
Finkelstein in 1958) the change of coordinates, defined for r > 2m, 
obtained by replacing the canonical Schwarzschild time t by the ‘retarded 
time’ v given by,” 


v=ttr+2mlog (= -1). (VI1.3.1) 


Using this new coordinate v together with r, 6, p, the Schwarzschild met- 
ric can be expressed as the so-called Eddington—Finkelstein (EF) 
metric 


T 


2 
(1 z) dv? + 2dr dv + r? (sin? 6 dy? + d6?). (V1.3.2) 


The EF metric is a smooth metric of Lorentzian signature? on the 
manifold S* x R* x R, (6,y) € S*, r € Rt, v € R defining a vac- 
uum Einsteinian spacetime called the Eddington—Finkelstein black hole. 
By its construction, the Schwarzschild spacetime is isometric with the 
domain r > 2m of the EF black hole. 


Exercise VI.3.1 Prove these statements. 


The submanifold r = 2m is null (isotropic), since g’” = 0 for r = 2m. 
One family of radial (ie. 0 = constant, p = constant) light rays is 
represented by straight lines v = constant; the other family is given by 


(1 n) dv + 2dr = 0; 
r 


that is, 


lif r,0,ġ are interpreted as polar co- 
ordinates on R3, then 6,¢ are only 
locally coordinates in the usual sense; 
the sphere S$? is not diffeomorphic to 
R? and hence cannot be covered by a 
single coordinate patch. 


2This change of coordinates from 
t,r,0,@ to v,r,@,@ is singular for 
r=2m. 


3The vanishing of the coefficient gvv 
at r = 2m does not correspond to a 
singularity of the metric, because its 
determinant, det g = 1, does not vanish 
there. 
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Fig. VI.2 (a) Orientation of the light 
cones in the standard Schwarzschild co- 
ordinates. (b) Null geodesics and light 
cones in the extension. Here t = v —r, 
where v is the ‘retarded’ time defined 
in (3.1). 


which integrates for r < 2m to 
v = 2r + dmlog(2m — r) + constant. (V1.3.3) 


Under the change of coordinates 


2m 
t=v—r—2mlog | — -1}, 
r 


the EF metric takes, in the domain r < 2m, the Schwarzschild form 
9 =i 
(=- i) dt? — (= -1) dr? + r?(d6? + sin? 6 dé”), 


but t is now spacelike, and r timelike. The metric is no longer static. 
It is singular for r = 0—in general interpreted as a limiting spacelike 
3-surface. The following theorem justifies the name of black hole given to 
the EF spacetime: no future light ray issuing from a point where r < 2m 
crosses the event horizon r = 2m, as can be seen from the light cones 
and null geodesics in Fig. VI.2. More generally, the theorem is as follows: 


Theorem VI.3.1 Ona timelike line issuing from a point with r < 2m, 
in standard coordinates, the variable r is always decreasing and tends to 
zero in a finite proper time. Hence any observer crossing the Schwarz- 
schild radius r = 2m attains the singularity r = 0 in a finite proper 
time. 


Proof. This is conveniently done in terms of the t,r coordinates, not- 
withstanding the fact that t is now a space coordinate and r a time 
one. 


(a) At 


r=2m 


r=2m 


`y 
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(1) Radial geodesics: we have found in Chapter V that on such geodesics, 
with s the proper time,* 


. dr 
ri= 
ds 


Hence the point r = 0 is attained from r = rp < 2m after the finite 
proper time 


f= BE? —(1-—2mr—") (VI.3.4) 


? 


roq 3d 
j dr = eas l (VI1.3.5) 
o dr 2m — (1 — E?)r 


(2) Non-radial geodesics and non-geodesic® motions lead to a smaller 
lapse of proper time between points with coordinates ro < 2m and 0. 


The amount of proper time taken for an astronaut entering the black 
hole to reach the singularity is indeed very short—it is of the order of 
the Schwarzschild radius estimated in geometric units. For a black hole 
with mass of the order of the solar mass, this gives a time of the order 
of 1075s. 


Remark VI.3.1 Because of the singularity for r = 0, the Finkelstein 
black hole cannot be considered as a generalized solution of the Einstein 
equations, in contrast to the 1/r singularity in the potential of Newtonian 
gravitation, which is solution of the Poisson equation with right-hand 
side a delta measure mass located at the origin. The idea now is rather 
to consider that, near this singularity, the gravitational field is so strong 
that it becomes a quantum field or even a string field, for which the 
Einstein equations are no longer applicable. 


VI.3.1  Eddington—Finkelstein white hole 


By time reversal, one obtains manifestly another extension of the 
Schwarzschild spacetime. The manifold is again S? x (r > 0) x R and 
the metric is 


r 


2 
(1 z) dv? — 2dr dv + r° (sin? 0 dy? + d0°). 


The extension to r < 2m now appears to observers in the Schwarzschild 
spacetime as a white hole: nothing can penetrate into it, but every past 
inextendible light ray or timeline in the Schwarzschild spacetime eman- 
ates from this white hole, as can be seen from the radial null geodesics 
and light cones in Fig. VI.3. 


VI.3.2 Kruskal spacetime 


It is possible to embed the Schwarzschild spacetime and both of its 
extensions in a larger spacetime containing an additional copy of the 


4We are in the region r < 2m, and 
hence 


E? — (1—2mr7') 
=r} [2m — r(1— E?)] >0. 


5Recall that in a Lorentzian manifold 
causal geodesics realize a local max- 
imum of length between two causally 
related points. 


6Remember also that for r < 2m, 
the coordinate r is timelike. In a two- 
dimensional, necessarily somewhat mis- 
leading, conformal diagram, r = 0 is 
represented by a spacelike line. 
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Fig. V1.3 Radial null geodesics 
and light cones of the Eddington— 
Finkelstein white hole. Here t = v —r, 
where v is the ‘retarded’ time defined 
in (3.1). 


TTachyon is the name given to a (so-far 
unobserved) particle travelling faster 
than light. 


aY 


r=2m 


Schwarzschild spacetime for which the previous black hole extension now 
plays the role of a white hole and vice versa. The support of the obtained 
spacetime is the manifold S$? x R?. 

The metric of the Kruskal spacetime reads, in Kruskal coordinates, 
32m? 
cate 7/2m(qz2 — dw?) +r? (d0? + sin? 0 d¢2), 

r 
where 0, ¢ are coordinates on S? while z, w are coordinates on the open 
set diffeomorphic to R? defined by 


ur S41; 


and r is the function of z and w defined by 


1 
z2 =w? = mC —2m)e"/?™, 
In Kruskal coordinates, the radial light rays are represented by straight 


lines. 


Exercise VI.3.2 Obtain the portion of the Kruskal spacetime isomet- 
ric to the Schwarzschild spacetime by the change of coordinates 


eS ev/am af gure, w= ev/4m _ gale. (VI.3.6) 
with 
r 
=t—r—2mlog (=~ -1). 13. 
u = t-r — 2mlog a (VI.3.7) 


Obtain other portions by analogous changes of coordinates. 


The Kruskal spacetime has two asymptotically flat regions (I and I’ in 
Fig. VI.4(a)), each of which is isometric to the Schwarzschild spacetime. 
A section through the Kruskal spacetime connecting these two regions, 
for instance w = 0, is called the Schwarzschild throat (Fig. VI.4(b)). 
No non-tachyonic’ signal can travel from one of the asymptotically flat 
regions to the other. 

The Kruskal spacetime cannot be embedded in a larger spacetime. 


(a) w 


r= constant < 2m 


r= 2m 


(Singularity) J x 
r=0 \ 


r= constant >2m 


r = constant >2m 


ta0 == 


r = constant 


r=0 N 


0 (Singularity) 


r=2m 


VI.4 Stationary black holes 


Most stars are not static? with respect to their local inertial reference 
frame, nor are they exactly spherically symmetric, so they cannot be 
modelled by a Schwarzschild spacetime or, after their collapse, by a 
Schwarzschild black hole. The simplest physical situations after static 
are the stationary ones (see Chapter IV), where the spacetime is of the 
type (M x R,g) with g invariant under translations along R. 


VI.4.1 Axisymmetric and stationary spacetimes 


In General Relativity, physical considerations? led to the conjecture!” 
that a stationary black hole is necessarily axisymmetric, a spacetime 
(V,g) being said to be axisymmetric if the underlying manifold is of 
the type M x R, with M, = M x {t} spacelike and {x} x R timelike, and 
the spacetime metric is such that M; is diffeomorphic to the exterior of 
a subset of R, g admits a group S1 of isometries that acts on R? like 
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Fig. VI.4 (a) Kruskal 
(b) Schwarzschild throat. 


spacetime. 


8In particular, their motion is not in- 
variant under time reversal. 


9A non-axisymmetric rotating body 
would lose energy by gravitational ra- 
diation and hence cannot be stationary. 


10The conjecture was formulated by 
Hawking in the 1970s and proved by 
him under an analyticity hypothesis 
and some restrictive geometric assump- 
tions (see Hawking and Ellis, 1973). 
The analyticity hypothesis and some 
of the geometric assumptions have 
recently been removed by Alexakis, 
Ionescu, and Klainerman (2013). 
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a rotation group with given origin O,, and the lapse and shift of g are 
also invariant under this group. 

A spacetime is called stationary axisymmetric if, the origins O; 
being derived from each other by time translations, the two Killing fields 
0/0¢ and ð/ðt commute. Moreover, for such a spacetime, it is required 
that the metric be invariant under simultaneous reversals of time t + —t 
and angle of rotation z! = ¢ |> —4@, as is consistent with physical 
interpretation. It holds then that 


902 = 903 = 912 = 913 = 9, 


since the signs of these coefficients change under such reversals. Using the 
fact that a two-dimensional metric can always be put in diagonal form, 
we write the metric of an arbitrary stationary axisymmetric spacetime 
as follows: 


g = goo dt? + 2go: dt do + g11 dd? + g22(dx?)? + g33(dx*)?. 


VI.5 The Kerr spacetime and black hole 


In 1963, Roy Kerr found a stationary axisymmetric metric, an exact 
solution of the Einstein vacuum equations, which was later interpreted 
as a black hole and became very important in General Relativity (see 
the following sections). 


VI.5.1 Boyer—Lindquist coordinates 


The Kerr metric, written in Boyer—Lindquist coordinates (found by 
Boyer and Lindquist in 1967), reads 


2 
(1 z) daima aes fa | Ade? 


A A 
2mra? sin? 6 
4 G geo -= ) sin? 6 d¢?, (VL.5.1) 
with a and m constants and 
A=r’+a’*cos?0, B=r?—2mr +a. (VI.5.2) 


The coordinate r appearing in this metric can be considered as defining 
the manifold structure of the support of this metric by interpreting it 
together with 0 and ¢ as polar coordinates on the outside of the subset 
of R? such that A > 2mr and B > 0; t € R is a time coordinate. 

The Kerr metric (written here in Boyer-Lindquist coordinates) is axi- 
symmetric (it does not depend on ¢) and stationary but not static (it 
does not depend on t, but it is not invariant under time reversal). It 
is invariant under simultaneous reversals of time t + —t and angle of 
rotation ¢ ++ —@, as required. The Kerr metric is interpreted as the 


VI.5 The Kerr spacetime and black hole 


gravitational field outside a rotating black hole. It has not been found 
to be the gravitational field outside any realistic rotating star. 

To interpret the constants a and m, we go to Cartesian-type coordin- 
ates 


r? + y? 
z i 


2 


r?=g° +y +z, 0 = Arctan (VI.5.3) 


We then see that for large r, the Kerr metric tends to the Minkowski 
metric and we have the following equivalence for its deviation from 
Minkowski: 


IKerr = (1 x) dt? + (1 x) dr? + r°(d0? + sin? 0 dd”) 
T r 
(VI.5.4) 


4ma 


— P dy — y dx) dt. (VI.5.5) 


The first line is equivalent for large r to the Schwarzschild metric: we 
identify m with the mass of the system. The last term leads to the 
identification of a as the angular momentum per unit mass of the system. 
The Kerr metric is invariant under the change (a,t) — (—a,—t), in 
agreement with this interpretation of a. 


VI.5.2 The Kerr—Schild spacetime 


The Kerr metric in Boyer—Lindquist coordinates becomes singular on 
various surfaces. As with the Schwarzschild metric, it is possible to trans- 
form it to another metric admissible in a larger domain. The following 
coordinates adapted to light propagation and redefinition of angular 
variable adapted to the rotation were introduced by Kerr and Schild in 
1965: 


r? +a? 


dv = dt + dr 
d® = do + au. 


The Boyer—Lindquist metric then takes the Kerr—Schild form 


yy 
gks = (1 | dv? + 2drdv+ Ade? + oF sin? 0 d? 


è 2 0 
damr dv d® + 2a sin? 6 dr de, 


with 

X? = (r? + a”)? — Ba? sin? 0. 
This metric extends to a regular Lorentzian metric for all A > 0. It 
reduces to the Minkowski metric in retarded time and polar coordinates 


of R? if m = a = 0, and it reduces to the Eddington—Finkelstein metric 
when a = 0. 
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VI.5.3 Essential singularity 


The Kerr metric becomes singular for A = r? + a? cos? 0 = 0. This is a 
genuine singularity: the Kretschman scalar tends to infinity when A 
tends to zero. 

In order to have the Kerr metric reducing to a form of the Minkowski 
metric when m = 0 and a ¥ 0, the coordinates are reinterpreted as 
follows: 0 and ¢ are coordinates on 8°, but r = 0 is not a single point 
of R°; it is assumed instead that the manifold R? defined by the Carte- 
sian coordinates x,y,z is represented in the variables r, 6, through the 
mapping (oblate polar ‘coordinates’ ) 


T= (r? +a?)? sin ®sin 0, y= (r? + a2)? cos @sin 8, 
z=rcosé. 


The singularity A = 0 is then interpreted as the circle r = 0,0 = 7/2, 
i.e. 


VI.5.4 Horizons 
The case |a| >m 


This would apply to very rapidly rotating bodies. We then have B > 0 
for all r, no surface r = constant is a null surface, and the essen- 
tial singularity is naked. This case is presently considered as physically 
unrealistic. 


The case |a| < m 


The Boyer—Lindquist metric then appears as singular for r = ry or 
r =r_, solutions of B = 0 given by 


re =mt Vm? —-a?. 


When a tends to zero, the Kerr metric tends to the Schwarzschild met- 
ric, r} tends to 2m, and r_ tends to zero. The surfaces r = r4 are 
not singular in the Kerr—Schild spacetime, but they are null surfaces— 
the contravariant component g’” of the Kerr—Schild metric vanishes for 
B= 0, as can be foreseen and checked by direct calculus. 

The surface r = r} is the event horizon: no particle entering the region 
r <r can escape from it: the future light cone at points where r = r} 
points entirely towards the interior. The surface r = r_ has no obvious 
physical meaning. 


VI.5.5 Limit of stationarity. The ergosphere 


The variable r becomes a time variable when 


2mr 


A 


>1. 
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The metric is then no longer stationary. The surface r = rgtaz, with rstat 
the largest root of A — 2mr = 0, is called the limit of stationarity. It 
does not coincide with the horizon when a Æ 0: it holds that 


stat = M + V M? — a? cos? 0 > ry. 


The domain between the limit of stationarity and the horizon r4 is called 
the ergosphere. 

The Penrose process is the extraction of energy from a Kerr black 
hole by dropping a particle into the ergosphere, which will emit a particle 
with a greater energy. The process can be described as follows. Consider 
the Killing vector X = ð/ðv, with components (X®“) = (1,0,0,0) in 
Kerr-Schild coordinates. This vector is timelike outside the ergosphere 
T > Tstat, 


2mr 


A 


GopXX? = — (1- ) <0 when r>Tstat; 

and spacelike when r} < r < rstat- A particle with timelike 4-momentum 
p has an energy E = —p*X, with respect to the vector field X; this 
scalar is constant along the trajectory (a geodesic) because of the law 
of dynamics and the fact that X is a Killing vector. It holds that E > 0 
outside the ergosphere. Suppose that the particle enters the ergosphere 
and splits there into two particles with momenta pı and pə. By the 
conservation of 4-momentum, it holds that 


p= pı +p2, E= E + Fo, 


with Fi = —p?X,. Since X is spacelike inside the ergosphere, it 
is possible to have the timelike vector pı such that E, < 0; hence 
E> = E — FE, > E. Both scalars E and Ea are conserved along the tra- 
jectories of the respective fragments. If the second fragment returns to 
the outside of the ergosphere, it will be with a greater energy (with re- 
spect to the X observer) than the whole piece which left it. It can be 
considered that it has extracted energy from the rotating black hole. 


VI.5.6 Extended Kerr spacetime 


The Kerr-Schild spacetime that we have defined is not complete. It is 
possible to extend it (even to negative r), but the results'! are much 
more complicated than in the Schwarzschild case and their physical 
interpretation very unclear. 


VI.5.7 Absence of realistic interior solutions or models 
of collapse 
Despite considerable effort, no realistic source has been found to make 


an exterior domain of a Kerr metric a complete admissible Einsteinian 
spacetime. 


11Cf. for instance, Wald (1984) and 
Chandrasekhar (1983). 
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12For details on the state of 
these hypotheses—in particular 
on the horizon—see Chapter XIV, 


Section 11 of YCB-OUP 2009 (con- 
tributed by P.Chruściel), Bray and 
Chruściel (2004), and Alexakis 
et al. (2013). 


13 This uniqueness was predicted in the 
early 1960s by J. A. Wheeler and de- 
scribed by the picturesque statement 
‘Black holes have no hair’. 


14See again Chapter XIV, Section 11 
by Chruściel in YCB-OUP 2009. 


No construction of gravitational collapse analogous to those due to 
Oppenheimer—Snyder, Gu-Hu, and Claudel-Newman in the spherically 
symmetric case has been made for axisymmetric gravitational collapse. 


VI.6 Uniqueness theorems for stationary 
black holes 


We have seen in Chapter IV that there are no gravitational solitons 
with support compact in space and sources with non-negative energy. 
There is no theorem on the non-existence of spacetimes with compact 
space outside the black hole region—but nor are there any examples of 
such spacetimes. Unless otherwise specified, a black hole is considered 
as an isolated object in spacetime; the black hole region in space is an 
asymptotically Euclidean manifold with boundary its event horizon. 


VI.6.1 


The uniqueness of a static black hole in vacuum, the Schwarzschild black 
hole, had been conjectured in the early 1950s by G. Darmois. It was 
proved with simple and precise assumptions by W. Israel. 


The Israel uniqueness theorem 


VI.6.2 Uniqueness of the Kerr black hole 


The following uniqueness theorem was proved in 1975 by Robinson, 
improving previous results of Carter from 1972. 


Theorem VI.6.1 Stationary, axisymmetric spacetimes that are so- 
lutions of the vacuum Einstein equations are uniquely specified by 
two parameters—the mass m and the angular momentum a—if they 
have a regular event horizon, are smooth outside the horizon, and are 
asymptotically flat. 


Since the Kerr metric provides a solution of the vacuum Einstein equa- 
tions satisfying the hypothesis of the theorem for any given m and a, the 
Kerr metric represents the only such axisymmetric stationary black hole. 

The Hawking—Klainerman theorem on the axisymmetric property of 
a stationary black hole completes, under its hypotheses,!? the ‘no- 
hair’ theorem!’ for a stationary black hole in a vacuum spacetime of 
dimension 4. 

In higher dimensions, other stationary black holes have been con- 
structed by Emparan and Reall.14 


VI.6.3 Stability of the Kerr black hole 


The intuitive definition of the stability of a dynamical system is that 
its evolution does not change much under a change in its state at an 
initial time. A stronger requirement if the system admits an asymptotic 


state for given initial data is that the system admits the same asymptotic 
state for data that are perturbations of these data. If the dynamics obeys 
nonlinear differential equations 


N(u) = f(D*u,...,Du,u) = 0, (VI.6.1) 


one is tempted to study the stability of a solution u, at least in a first 
step, to look for the evolution of a perturbation du that satisfies the 
system linearized at u: 
gog Diu ae: oy (wou =i (V1.6.2) 
If ĝu remains small for all times for small initial data, the solution u 
is called linearization-stable. If ĝu tends to zero when the time tends 
to infinity, u is called asymptotically linearization-stable. Linearization 
stability is often treated by physicists by considering perturbations given 
by Fourier series. The linearization stability of the Schwarzschild space- 
time was thus found by Zerilli and Wheeler in the early 1960s. Results 
on the linearized stability of the Kerr spacetime have been obtained.!° 
Nonlinear stability is a much more difficult problem, requiring global 
existence of solutions of nonlinear differential equations. The nonlinear 
stability of the Kerr spacetime is a subject of active research, in 
particular by S. Klainerman and collaborators, with conjectures and 
partial results. 


VI.7 General black holes 


An intuitive physical definition of a black hole is easy to give: a black 
hole is a region of spacetime in which the gravitational field is so strong 
that no signal, in particular light, can get out. A black hole may reveal its 
presence by the motion of surrounding stars that indicates the presence 
of an enormous mass confined in a region too small for any known matter 
to be contained in it. The presence of a black hole can also sometimes 
be deduced from the very short and bright emission!® of y rays (a y-ray 
burst) due to the explosion of an accretion ring formed around the black 
hole by its tremendous gravitational pull. Black holes are the subject 
of active observational research. The black holes interior is physically 
terra incognita, and very strange indeed. It is the subject of fascinating 
advanced and varied mathematical studies and conjectures. 


VI.7.1 Definitions 


A mathematical definition of an isolated black hole given by Penrose 
through a conformal diagram of an asymptotically Euclidean spacetime 
is often used. A definition that does not appeal to conformal compact- 
ification can be given for isolated black holes, or for a group of black 
holes and material bodies that is isolated in the sense that it is so far 
from other bodies that it may be considered to be embedded in an 
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15See, in particular, Dafermos, 
Holzegel, and Rodnianski (2013). 


16 Although it is rather thought now 
that this emission is in general quasi- 
stationary. 
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17See, for instance, Penrose (1979). 


18Generic is usually taken to mean 
without exceptional properties or sta- 
ble under small perturbations. 


asymptotically Euclidean spacetime (V, g). The following is a possible 
definition: 


Definition VI.7.1 The black hole region B of the spacetime (V, g) 
is the complement of the past of the set covered by the null geodesics that 
have an infinite future canonical parameter. 

The event horizon H is the boundary OB of the black hole region B 
in the spacetime V. Under appropriate reasonable assumptions, the event 
horizon H is a null hypersurface (C° but not necessarily Ct), generated 
by null geodesics. 


VI.7.2 Weak cosmic censorship conjecture 


The original idea of Penrose,!” coming from the study of spherical gravi- 


tational collapse where a black hole forms and hides the singularity from 
timelike observers, is the conjecture that generic Einsteinian spacetimes 
with physically reasonable sources do not admit any naked singular- 
ity, that is, a singularity visible by an observer. With the definition of 
singularity by incompleteness, trivial counterexamples can be obtained 
by cutting out regions of any spacetime, for example Minkowski space- 
time. Therefore, the mathematics must be more precise to grasp the 
physical content. We give the following definition of a nakedly singular 
spacetime. 


Definition VI.7.2 An inezxtendible spacetime (V, g) is said to be future 
nakedly singular if it admits a future inextendible causal curve that lies 
entirely in the past of some point x E€ V. 


We formulate the conjecture as follows: 


Definition (No naked singularity conjecture). An ineztendible, 
generic,'® Einsteinian spacetime with physically reasonable sources ad- 
mits no naked singularity. 


Note that the big bang is not a counterexample to this conjecture—it 
has no past, and hence does not correspond to any future inextendible 
causal curve. 

A generic spacetime can be understood as a spacetime that is stable, in 
some sense to be defined, under small perturbations. Reasonable sources 
are physical sources that have a hyperbolic, causal evolution and do not 
have their own singularities (shocks, shell crossings, etc.). 

We will briefly return to cosmic censorship in Chapter VIII. 


VI.7.3 Thermodynamics of black holes 


There are many theorems and conjectures about the thermodynamics 
and quantum properties of black holes—difficult to prove and even of- 
ten to state in a precise way. Several properties have analogies with 


the fundamental laws of thermodynamics. They have led physicists to 
attribute to general black holes both entropy and temperature. 

The first black hole ‘thermodynamic’ property is the Hawking area 
theorem, which says that the surface area A of the horizon H at a 
given time cannot decrease with time: it led to the identification of A, 
up to a constant factor, with the entropy of the black hole. 

The temperature of a black hole has been defined through its surface 
gravity,!® often denoted by « (not to be mistaken with the gravitational 
constant Gy, which is also often denoted by «). The usual definition of 
the surface gravity does not apply to the horizon of a black hole. It 
is defined for stationary black holes by the formula, with k the Killing 
vector tangent to the horizon, 


k°Vak? = Kk. 


This definition has been challenged by many authors and is still the sub- 
ject of active discussion and possible generalizations to non-stationary 
black holes. 

Hawking radiation from black holes is a quantum effect. It is well 
known that there are no absolute barriers in quantum theory. Hawking 
introduced quantum waves to study radiation from black holes and pro- 
vide a consistent definition of their entropy and its variation (i.e. the 
decrease of their horizon area) with ‘black hole evaporation’. Such ra- 
diation could be due to splitting of virtual pairs of particles emerging 
from the quantum vacuum outside the black hole, with one particle fall- 
ing into the black hole and the other escaping from it. Hawking radiation 
is very weak and has not been observed. 

These properties of black holes are fascinating and the subject of active 
study and discussion, involving also quantum gravity, a largely open sub- 
ject. They are at present beyond any possible physical confirmation and 
mostly also any rigorous mathematical proof, although progress in the 
construction of a quantum theory of gravitation, in particular through 
string theory, opens new roads to the modelling of black hole interiors. 

An exception, as far as mathematical proofs are concerned, is the proof 
through sophisticated mathematics by Huisken and Ilmanen and inde- 
pendently by Bray?° of the Riemannian Penrose inequality, which 
admits a purely geometric formulation. It is linked with several theorems 
or conjectures due to Penrose (see Problem VI.10.5). 


VI.8 Conclusions 


VI.8.1 Observations 


The conjectured existence of the first black hole, Cygnus X1 in 1972, 
was deduced from the study of a star’s orbital parameters, the values of 
which implied the existence of an invisible companion too massive to be 
thought to be even a neutron star. An enormous number of candidate 
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19For material bodies, the surface grav- 
ity in Newtonian theory is the gravi- 
tational acceleration at points of its 
surface, about 9.81 ms~? in the case of 
the Earth. 


20See, for instance, Bray and Chruściel 
(2004). 
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black holes have now been detected. Astronomers usually classify black 
holes as follows. 

Stellar-Mass black holes with masses between 10 and 24 solar masses 
reveal their presence by the motion of companion stars due to the 
existence of a strong gravitational field that cannot be explained other- 
wise. Another indication of the presence of a black hole is radiation 
emitted from matter falling into the black hole under its tremendous 
gravitational pull. 

Supermassive black holes, millions (or billions) times more massive 
than the Sun, are thought to be in the centres of many galaxies, including 
our own Milky Way. They are detected by the motion of nearby stars 
and gas. 

Recently, evidence for intermediate-mass black holes has been ob- 
tained by powerful new telescopes, such as Chandra, XMM-Newton, 
and Hubble. 

There is an abundant literature on the subject in relevant scientific 
journals and on the Internet. 


VI.8.2 The interiors of black holes 


We have seen that the representation of the interior of a Schwarzschild 
black hole by a Lorentzian manifold ceases to be meaningful at the sin- 
21 Although not so short for a very gularity r = 0, attained in a very short time?! by any infalling object. 
large me M 7 since it bis aa The representation of the singularity in the Kruskal diagram by a space- 
P ar M Mines S ASAE like submanifold is misleading. The interior of the Kerr black hole and 
ii its singularity are very strange. They cannot be considered as Einstein- 
ian spacetimes in any classical sense. The general belief is that a new 
theory of gravitation must be constructed to represent them. This is a 
subject of very active research, linked with the old problem of the quant- 
ization of the gravitational field. Previous research in this difficult field 
has known dramatic transformations through the introduction of new 
and fascinating ideas, in particular string theory. However, further dis- 
cussions of black holes and quantum gravity are outside the scope of this 
book. 


VI.9 Solution of Exercise VI.3.1 


(1) With t now denoted by p and r by 7, the Schwarzschild metric reads 
2m pre 2m 2 22/992 4 ain? 2 
-| — -1 dr“ + | — — 1 ]| dp? + T“ (d0 + sinf 0 d¢*), 
T 


It has Lorentzian signature for 0 < T < 2m, with 7 a timelike co- 
ordinate. For any such given 7, the space metric is defined on the 
product R x 9?; it is degenerate at r = 2m and singular for T = 0. 


(2) The definition 


v=tt+r+2mlog (— z 1) , (VL9.1) 
2m 
i.e. 
md d 
dediti t — A, sge 
r — 2m r — 2m 


implies by a simple computation the announced result: 


2m 3 2m\ ! 2 2m 2 
1 dt" + {1 dr“ = | 1 — — | (dv* + 2 dvdr). 
r r r 


(3) The metric is Lorentzian because 


2dvdr = ! [(dv + dr)? — (dv ar)’ ; 


VI.10 Problems 


VI.10.1 Lemaître coordinates 


As was done by Lemaitre in 1933, consider coordinates 7, p, 0, ġ given in 
terms of the standard Schwarzschild coordinates t,r,0,ġ on R* by the 
functions 


V2m — yr 
T := t+ 2V 2mr + 2m log | —W—}, 
V2m+ yr 
2 
3/2 
T= rl —__. 
P 3v 2m 


1. Show that the coordinates change is singular for r = 2m. 


2. Show that in the Lemaître coordinates, the Schwarzschild metric takes 
a form that is singular only for r = 0, but non-static: 


2 
—dr? + do? + r?(d0? + sin? 0 dd?). 
r 


3. Determine the radial geodesics. 


Solution 


1. The logarithm tends to —oo when r tends to 2m. 
2. This is a straightforward computation. 


3. The radial geodesics are lines where only 7 varies. 
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Problems 


151 


152 Black holes 


22 Christodoulou and Ruffini had shown 
in 1971 (see Ohanian and Ruffini, 2013) 
that the irreducible mass increases in 
any Penrose problem, proving thus in a 
particular case the Hawking area the- 
orem. For the general case, see Hawking 
and Ellis (1973). 


VI.10.2 Reissner-Nordström black hole 


Discuss for which values of the constants m and Q the Reissner— 
Nordström metric (see Chapter V) 


2n Q? 7 2m Q? z > 
= 1 dt“ + {1 
a ( r'r ) ( rra) M an 


+ r?(d6? + sin? 6 dy”), 


with electromagnetic potential A; = 0, Ag = —Q/r describes a black 
hole and study its event horizon. 


Solution 
The metric grn is a smooth Lorentzian metric for all r Æ 0 if 
r? —Imr + Q? > 0, 
that is, m? < Q?, since the roots of this polynomial in r, 
re =m+t Vm? — Q?, (VI.10.2) 


are then imaginary. For Q? < m?, the metric becomes singular in the 
standard coordinates on the two manifolds r = r} and r = r_. The 
manifold r = r+ is the event horizon. 


VI.10.3 Kerr—Newman metric 


The Kerr-Newman metric reads 


2mr 2 sin? 0 A 2 
JKN = (1 A ) dt“ —4mra A dt dọ + Brg” 


2mra? sin? 0 
+ Ade? + (e say om) sin?9d¢?,  (VLL2) 
with a and m constants and 
A=r’+a’cos?6, B=r?-2mr+a’. (VI.1.3) 


Check that gxn coincides with the Kerr metric in Boyer—Lindquist 
coordinates if Q = 0, and with the Reissner—Nordstr6m metric if a = 0. 

Show that gxwn satisfies the Einstein equations with electromagnetic 
source. 


VI.10.4 Irreducible mass (Christodoulou—Ruffini) 


Assume the Hawking area theorem,?? which says that no physical process 
can make the area of the event horizon of a black hole decrease. Show 
that the energy loss in a Penrose process in Kerr spacetime is at best 
50%. 
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Solution 


The radius of the event horizon of a Kerr spacetime with parameters m 
and a is 


re =m+my1- ]|Jl?, 


where J denotes the spin of the black hole. 

The event horizon of the static spacetime resulting from an optimal 
(i.e. absorbing all the rotational energy) Christodoulou-Ruffini-Penrose 
process with mass m’ is r = 2m’. The Hawking area theorem implies 


1 
m > gm + v1- |J]2). 
The gain in mass is 


1 1 


m ms 5m 1 < =m. 
eller 


It tends to m/2 when |.J| tends to its maximum value 1. 


VI.10.5 The Riemannian Penrose inequality 


We have given in Chapter V the Schwarzschild metric in isotropic 
coordinates, 


2R—m : m \4 
= dt? + (14 dX?+dY?4+dZ? 1.10. 


and have shown in Chapter IV that it has ADM mass mapm = mM. 


Prove that 
| A 
m= \/—, 
167 


where A is the area of the horizon. 


Solution 
The horizon, deduced from the expression for the spacetime metric, is 


m 
Rs 
2 


Straightforward classical calculus gives the area of this 2-sphere: 


2 
=| 2* dX dY dZ = T 24r = 160m’. 
Bmj2 
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23For details, see Bray and Chruściel 
(2004) and references therein. 


Comment. The above equalities and the Beckenstein-Hawking area 
theorem led Penrose to conjecture that general black holes should satisfy 
an inequality of the form 


A 

m <4/—. 

~ V 167 

This inequality considering only the space manifold, then called the ‘Rie- 
mannian Penrose inequality’, has been proved by Huisken and Ilmanen 
under the hypotheses that the space manifold is a Riemannian asymptot- 
ically Euclidean manifold with non-negative scalar curvature that admits 
a compact boundary composed of minimal surfaces but no other min- 
imal surface.?? The considered area A is the area of any disconnected 
such surface. The relevance to physics is that such boundary surfaces are 
‘trapped surfaces’ conjectured to be linked with black holes containing 
a singularity of the corresponding spacetime. 


Introduction to cosmology 


VII.1 Introduction 


A cosmological model is a spacetime that is supposed to represent the 
whole past, present, and future of our universe. It has long been argued 
that in the framework of General Relativity, gravitation should model 
the geometry of the cosmos because its forces are long range (in contrast 
to nuclear forces) and non-compensated (there are no negative masses— 
in contrast to the existence of positive and negative electric charges). 
Since Einstein’s formulation of his equations, it was assumed by most 
cosmologists that the cosmos in which we live is a Lorentzian 4-manifold 
(V,g) that satisfies the Einstein equations with the non-gravitational 
energies as sources. In the beginning, cosmological models relied essen- 
tially on a priori conjectures based on philosophical prejudices, the aim 
of simplicity, and the desire for unification. 

Because of considerable progress in astronomical observations by both 
Earth-based telescopes and satellites,! cosmology has become a fully 
fledged part of physics. There is a wealth of information that has been 
accumulated in recent years from observations of the cosmos and that 
needs to be analysed. There have been remarkable advances in our know- 
ledge, but also new and puzzling questions. We cannot in this book enter 
into a detailed exposition of the amazing number of observations made 
since the beginning of this century, and we can only give some indication 
of how remarkably precise conclusions, often convincing but sometimes 
speculative, have been deduced from these observations. 

We will in this chapter give the main facts that led a majority of cos- 
mologists (although there are still dissidents) to believe that our cosmos 
is represented by a four-dimensional Robertson—Walker spacetime (see 
Section VII.4) solution of the Einstein equations with source all the 
non-gravitational energies. Radiation energy is today negligible with re- 
spect to matter energy, but is thought to have been dominant at early 
stages. The matter source was considered until fairly recently as a perfect 
fluid, whose particles would be galaxies or clusters of galaxies. Obser- 
vations made at various scales have now shown that the usual matter 
sources represent only a quite small fraction, an estimated 4.5%, of the 
energy content of the universe. Modern cosmologists have introduced 
new types of sources: dark matter (about 25%) and dark energy (about 
70%). There are numerous physical conjectures regarding the nature of 
these sources. A possible interpretation of dark matter is the existence 
of WIMPS (weakly interacting massive particles), particles that interact 
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1Such as COBE (Cosmic Background 


Explorer), 


and WMAP (Wilkinson 


Microwave Anisotropy Probe), and the 
information gathered by the Planck 
satellite (analysed and published in 
March 2013). 


?Dynamical properties of celestial bod- 
ies, for instance the flat aspect of some 
galaxies, cannot be explained by their 
‘normal’ mass content. 
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3Though not quite by all of them— 
there are still adherents of the steady- 
state and continuous-creation theory of 
Gold, Hoyle, and Narlikar and also of 
I. E. Segal’s theory of chronogeometry 
(Segal, 1976). 


4See, for instance, for results and ref- 
erences prior to 2002, Cotsakis and 
Papantonopoulos (2002) and for refer- 
ences prior to 2010, Cheng (2010). Most 
recent observations can be found in 
papers on the arXiv (gr-qc or astro-ph). 


5QOlbers pointed out in 1823 that if 
space was infinite and filled with a 
homogeneous distribution of stars, the 
sky would appear uniformly bright— 
in clear contradiction with the dark 
night sky. Historians of astronomy have 
found that Olbers was not the first to 
make this remark, but it is his name 
that is still attached to it. 


6 Another obstacle to the acceptance of 
the Einstein static universe as a model 
for our cosmos is that it is unstable. 


7 An idea Einstein did not like, because 
it introduces a new parameter, a priori 
not geometrically defined. 


8These properties have been used by 
Penrose to give a geometric definition 
of asymptotic flatness; see, for instance, 
Hawking and Ellis (1973) or Section 6 
in Appendix VI of YCB-OUP2009. 


only through the weak field and gravitation. Dark energy is often in- 
terpreted as an energy of the vacuum, represented by a cosmological 
constant, or possibly a scalar field, called quintessence. 

In Section VII.7, we give a brief description of what is currently 
thought by most cosmologists® to be the birth and infancy of the cosmos 
in which we live.* 


VIIL.2 The first cosmological models 


Soon after the Einstein equations were first framed, models were 
proposed for the whole cosmos. 


VII.2.1 Einstein static universe 


Historically, the first cosmological model was found in 1917 by Ein- 
stein himself, looking for a solution that was homogeneous and eternal, 
would solve Olbers’ paradox,” and would be in agreement with Mach’s 
principle. He was thus led to the Einstein static universe (see Prob- 
lem IV.11.1) with closed space manifold $, and spacetime manifold 
S? x R endowed with the static metric 


— dt? + agys, (VII.2.1) 


with ag a constant and 7, the metric of the unit 3-sphere S°, that is, in 
the usual angular coordinates, 
y+ := d0? + sin? 6 (dd? + sin? (dy?) 
Setting sin 0 = r gives the form familiar to geometers: 
dr? 
l-r? 
We have seen that the Einstein static universe is a solution of the Ein- 


stein equations with static perfect fluid source of constant density and 
pressure given by 


y+ = +r? (d? + sin? ¢ dy”). (VII.2.2) 


Ho = 349°, Po= (VII.2.3) 
A negative pressure is unacceptable on classical physical grounds.® 
This difficulty was remedied by Einstein by introducing a positive 
cosmological constant.’ The formulas (2.3) are then replaced by 


(VII.2.4) 


Einstein choose A = ag 2 so that po = 0. 

The Einstein static spacetime, also called the Einstein cylinder, is 
(but with a different interpretation) the arena of the Segal cosmos. 
The manifold $ x R is the universal cover of compactified Minkowski 
spacetime; it is conformally locally flat, and the conformal group acts 


on this geometric, non-metric, structure. The fact that representations 
of the conformal group govern the world of elementary particles seemed 
to I. E. Segal a justification of his cosmological model. He explains the 
cosmological redshift by the existence of a local time different from the 
cosmic time. 

A few years after introducing the cosmological constant (in order to 
obtain a physically realistic time-independent cosmos), Einstein aban- 
doned it (calling it the greatest blunder of his life) and accepted 
time-dependent models, motivated by the observation by astronomers 
of the redshift of light coming from distant stars, interpreted as being 
due to the expansion of the universe.® 


VII.2.2 de Sitter spacetime 


The de Sitter spacetime (see Problem IV.11.2) was constructed in 1917 
by Willem de Sitter. It is a solution of the four-dimensional Einstein 
equations with positive cosmological constant. It is not realistic, because 
it is a vacuum solution, but was the first example of a time-dependent, 
expanding, Einsteinian spacetime. 

Einstein and de Sitter had long discussions about their respective 
models. These discussions, and the discovery of the expansion of the 
universe, led Einstein to abandon the cosmological constant, but accept 
time-dependent spacetimes. 


VII.2.3 General models 


In the classical models, still basic for cosmological studies, the cosmos 
is modelled by a four-dimensional Einsteinian spacetime. Nowadays, all 
cosmologists consider, as it is experienced by everybody in daily life, 
that there is a flow of time—that is, a past and a future. They speak of 
the universe as the cosmos at a given time. Note that we do not observe 
the present state of the cosmos, since information comes to us at most 
with the speed of light. Most cosmologists now accept that the universe 
is expanding—indeed that the expansion is accelerating, according to 
recent observational data. 


VII.3 Cosmological principle 


VII.3.1 Assumptions 


The standard cosmological models are based on the so called 
cosmological principle, which is composed of two assumptions. 

The first assumption is that the cosmos is a manifold M x R endowed 
with a Lorentzian metric, four-dimensional in the usual case, such that 
the lines x x R, x € M, called trajectories of the ‘fundamental obser- 
vers’, are timelike geodesics orthogonal to the manifold M x {t} at each 
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Recent observations indicate that the 
cosmological expansion is accelerating. 
This has led cosmologists to reintro- 
duce a cosmological term. 
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10That is, for each pair of points x and 
y in Mr, there exists an isometry of 
(M, g) that brings z to y. 


1 First found by Penzias and Wilson in 
1964, but predicted earlier by Gamov 
from the big bang theory. 


12A black body is one that absorbs all 
radiation. Contrary to its name, a black 
body emits radiation when heated, but 
the graph of the emitted energy in 
terms of wavelength (the black body 
spectrum) is a roughly Gaussian-type 
curve with a peak depending on the 
temperature. 


13Planck satellite in 2013. 


14There is no real difficulty in ex- 
tending the definition to n + 1 di- 
mensions, although the space Riemann 
tensor is then no longer equivalent 
to the Ricci tensor. Note also that 
the topological classification of three- 
dimensional manifolds does not extend 
to higher dimensions. 


point (x,t) € M x R. The Lorentzian metric of the cosmos can therefore 
be written as 


(gq = —dt? + Bg. 


The Riemannian manifold (M,®? g) is the universe at time t. The proper 
time of fundamental observers is called cosmic time. 

The second assumption of the cosmological principle is that the uni- 
verse at each time should look the same in all directions and also to 
any fundamental observer. This assumption is called the Copernican 
principle—in the name of Copernicus, who deprived us of a central pos- 
ition in the Solar System. We also know that the Sun does not occupy a 
remarkable place in our galaxy. The mathematical content of the second 
assumption, modulo the first one, is as follows: 


(1) Isotropy: the Riemann tensor of the space metric “)g is at each 
point x invariant under rotation in the tangent space to M centred 
at x. 


(2) The space metric is homogeneous, i.e. it admits a transitive? 
group of (global) isometries. 


VII.3.2 Observational support 


We see the stars very unevenly distributed in the night sky. Galaxies, 
and even clusters of galaxies, are also observed in our telescopes as being 
very anisotropically and inhomogeneously distributed. An argument in 
favour of adopting the cosmological principle is that, at a still larger 
scale, isotropy and homogeneity seem to be attained. 

For most astrophysicists, the strongest evidence for the validity of the 
cosmological principle is the isotropy of the CMB (cosmic microwave 
background) radiation. This is the faint background glow! that sen- 
sitive radio telescopes detect in the sky, which otherwise appears dark 
between stars and galaxies to traditional optical telescopes. The CMB 
radiation is measured to be very nearly isotropic, with a temperature of 
about 2.725 K and a black body’? spectrum. An observed discrepancy 
with isotropy of order 107° is interpreted as being due to our own mo- 
tion relative to fundamental observers. There are further anisotropies of 
the order of 107°: the most recently observed? anisotropic structures at 
this scale seem to be best explained by random inhomogeneities in the 
big bang. 

An argument in favour of homogeneity is that the fundamental phys- 
ical constants seem to be and have been the same throughout the cosmos 
with a remarkable accuracy. 


VII.4 Robertson—Walker spacetimes 


The Robertson—Walker spacetimes are (3 + 1)-dimensional'+ models 
satisfying the assumptions of the cosmological principle; that is, they 


VII.  Robertson—Walker spacetimes 


are represented by metrics that read, on a product M x R with Ma 
three-dimensional manifold, 


(g = -dt +P) 9, (VII.4.1) 


where ®)g is a t-dependent Riemannian metric on M whose Riemannian 
curvature is isotropic at each point of M. We will see that this implies 
that the metric )g is homogeneous. 


VII.4.1 Robertson—Walker universes, metric at given t 


The curvature tensor of a Riemannian metric g at a point x € M is 
isotropic at x, that is, invariant under rotations in the tangent space 
to M at x, if and only if it is of the form 


Rij,nk(@) = K(x)(gih jk — Jin Jik) (2). (VII.4.2) 
Exercise VII.4.1 Prove the ‘if’ part of this property. 


Since in dimension 3 the Riemann tensor is equivalent to the Ricci 
tensor (4.2) can be replaced in that case by (recall that Sj; = Ri;— igi R 
denotes the Einstein tensor) 


OR; =2K Og, hence ©Sy = -KO gz. (VII.4.3) 
The contracted Bianchi identity implies 
Oy Osi =0, ie. 3K =0. 


Hence, K is a constant; the isotropic metric “)g is also homogeneous. 
Riemannian spaces with curvature of the form (4.2) with K a constant 
are called spaces of constant curvature. 

To determine the general Riemannian spacetimes (M,®) g) of constant 
curvature, it is convenient to use, in a neighbourhood of an arbitrar- 
ily chosen point, polar pseudo-coordinates centred at that point, in 
which the spherical symmetry resulting from the cosmological principle 
is manifest. In these coordinates, the metric (see Chapter V) takes the 
form 


(3) = e” dr? + r?(d0? + sin? 0 do?), with p= plr). (VII.4.4) 


Equations (4.2) then read (the prime denotes the derivative with respect 
to r) 


O) Rip =0, i£ k, O) Rii =r tp =2Ket, (VII.4.4) 


1 6) 
—— Ra = |) Rog = -e # $14 ce "pu =2Kr?,  (VIL4.5) 
sin* 0 2 
The general solution of (4.4) is trivially found to be e 4 = 
—Kr?+constant, and (4.5) then gives 


e#=1- Kr’. (VII.4.6) 
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15Tn an old terminology, universes sup- 
ported by compact manifolds were 
called ‘closed’ (recall that, unless other- 
wise specified, manifolds are without 
boundary). Those with non-compact 
support were called ‘open’. 


16When e€ Æ 0, a(t) is equal to |K|-2, 
K being the constant-in-space, time- 
dependent, scalar Riemann curvature 
of ©)g. 


We already know that a metric of constant curvature K = 0 is locally 
flat. 

A metric with constant curvature K > 0 [respectively K < 0] is lo- 
cally a 3-sphere of radius K 2 [respectively locally a hyperbolic 3-space 
of radius |K|~2]. In these cases, one classically scales r by setting 
r= |K|~2?7 and relabels 7 as r. In the new coordinate r, the metric 
takes one or other of the standard forms according to the sign of K: 

(3), — —1 _ dr? Qos 2 2 . 

g = |K| ye, w= z t+r(sin’ 0 dọ +d"), e= sign K. 
— Er 
(VIL.4.7) 
The metric y- can be transformed into the familiar forms of the unit 
sphere or pseudosphere metrics as follows: 


when £ = 0, r = x, we have 


yo = dx? + x? (sin? 6 do? + dé”); (VII.4.8) 
when £ = 1, r = sin x, we have 
y4 = dy? + sin? y (sin? 6 de? + dé”); (VII.4.9) 
when £ = —1, r = sinh y, we have 
y- = dx? + sinh? x (sin? 6 do? + d6?). (VII.4.10) 


The isotropic homogeneous simply connected Riemannian manifold 
corresponding to £ = 1 with 0 < x < m and 0, ¢ angular coordinates on 
S$”, is the sphere $*, a compact Riemannian manifold.!° 

The metrics with £ = 0 or € = —1 can both be supported by the non- 
compact manifold R°, with coordinate 0 < y < oo. The corresponding 
isotropic, homogeneous, simply connected Riemannian manifolds are the 
Euclidean space E? or the hyperbolic 3-space H3. 

We have computed Robertson—Walker spaces supported by simply 
connected manifolds by looking for globally spherically symmetric met- 
rics around one point. There are other metrics that are isotropic and 
homogeneous but not globally spherically symmetric, which can be ob- 
tained as quotients of the previous metrics by an isometry group. They 
are supported by manifolds with different topologies. An example in the 
case € = 0 is the flat 3-torus. 


VII.4.2 Robertson—Walker cosmologies 


From Section VII.4.1, there are three types of Robertson—Walker space- 
time metrics: 


— dt? + a? (t)ye, ¢€=1,—-1, or 0, (VII.4.11) 


where a is an arbitrary function’® of t. We see that a(t) is a scaling fac- 
tor of local spatial distances of fundamental observers. In an expanding 
universe, (i.e. if à := da/dt > 0), this distance increases proportionally 
to a and to the original distance, as in an inflated balloon. Indeed, on 
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the trajectory of a fundamental observer, only t varies; the space dis- 
tance at time t of two fundamental observers moving respectively on the 
timelines x = x and z = gı is the product by a(t) of the distance in 
the metric ye between these two points of M. 


VII.5 General properties 
of Robertson—Walker spacetimes 


VII.5.1 Cosmological redshift 


The first essential cosmological data came from the observation of the 
redshifts of stars and galaxies. The redshift parameter is defined to be 
z=% L], (VIL5.1) 
Vo 
where vo is the observed frequency and vg the emitted frequency. There 
is a shift towards the red [respectively towards the blue] if z > 0 
[respectively z < 0]. We now prove the following proposition. 


Proposition VII.5.1 In an expanding Robertson—Walker spacetime, 
there is a cosmological redshift with parameter given approximately by 


t 
tage eee 0) 


(VII.5.2) 


if we assume that the variation of a is negligible during a period of the 
emitted and a period of the received light signal. 


Proof. Recall that a Robertson—Walker metric is of the form 
— dt? + a° (t) [dx? + f?(x)(d6? + sin? 0 d¢”)) . (VIL.5.3) 


Let Oo and Og be two fundamental observers. Take coordinates such 
that they have the same 0 and @ and have y coordinates 0 and yg, 
respectively. An observer O with space coordinate yo = 0 (this is no 
restriction) and t = to receives at time to light emitted at time ts < to 
by a source S' with space coordinate y(ts) = xs > 0 ifit is on the light 
ray joining these two spacetime points. This light ray is the solution of 
the differential equation 


dt 
a a(t) (VIL.5.4) 
given by 
ta di 
= ——, VII.5.5a 
s= (VIL.5.5a) 


Photons emitted by Og situated at y, at times ts and ts + Ts follow 
null rays and arrive at Op situated at yo = 0 at times tọ and to + To 
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17A statistical treatment is necessary 
because an individual star, even an in- 
dividual galaxy or cluster of galaxies, 
cannot be considered as a fundamental 
observer. 


18This again is a statistical statement 
with regard to distances of fundamen- 
tal observers emitting and receiving 
signals. 


that satisfy 


to dt tot+To dt 
J —— = / —_. (VII.5.6) 
ts a(t) ts+Ts a(t) 
Elementary calculus shows that the times Tg and To are therefore related 
by 
ts+Ts dt to+To dt 
/ —— = f —_. (VIL.5.7) 
ts a(t) Jio a(t) 


Hence, if a is considered as constant during the small amounts of time 
Ts (period of the emission) and To (period of the reception), we obtain 


Ts _ a(ts) 
— = : II.5. 
To a(to) ee 
Since the frequency is the inverse of the period, this result leads to 
t 
vs _ alto) (VIL5.9) 
vo alts) 
which is the desired relation (5.2). a 


A positive cosmological redshift, vo < vg signals an expansion of the 
universe, a(t) > a(ts). 

Statistical observations!” of distant galaxies confirm, if we analyse 
them in the framework of a Robertson—Walker spacetime, that our uni- 
verse is at present expanding, a’(to) > 0. Over the last few years, it has 
been observed that the expansion is accelerating: a” (to) > 0. 


VII.5.2 The Hubble law 


In a Robertson—Walker spacetime, the distance between two given fun- 
damental observers at some cosmic time t is proportional to a(t). The 
Hubble parameter is defined to be the constant-in-space, t-dependent 
scalar 


Hea a, (VII.5.10) 


It measures the rate of expansion (or possibly contraction) of the 
universe. It has dimension (time)~!. An expanding Robertson—Walker 
universe has a positive Hubble parameter. 

The Hubble law says that the observed redshift is proportional to the 
distance of the source.'® It is true as consequence of the theory only in 
first approximation for not too distant sources, as we now show. 

Approximate computation of the redshift gives 


vs . a(to) 


1+z:= = : 
vo a(tg) 


(VII.5.11) 


q z (to) (to — ts) 
vo a(ts) a(ts) 
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Recall that 


Xs = f i ra = (to — ts)a™t (ts), (VIL.5.12a) 


S 


and hence, with ds = a(to)xs, 


o (to — ts) = H (to)ds. 

a(ts) 

For a given observer at a given time, the redshift is, in rough approxi- 
mation, proportional to the distance of the source. 

The Hubble law has long been used by astronomers to estimate dis- 
tances from the Earth of various astronomical objects that are too far 
away for these distances to be measured by triangulation. It is then not 
logical to use these redshifts to estimate distances. Fortunately, distances 
can also be estimated by comparing the luminosity of a star as observed 
from Earth (the ‘apparent luminosity’) with the emitted luminosity (the 
‘absolute luminosity’) as calculated from the physical nature of the star. 
In the past, cepheid variables (whose absolute luminosity is related to 
their period of variation) were used as such ‘standard candles’, but it was 
discovered that there are two kinds of cepheids. More reliable ‘standard 
candles’ have been found by astronomers, particular so-called type Ia 
supernovae. !9 


VII.5.3 Deceleration parameter 


A second approximation of the relation between redshift and distance 
is obtained by introducing the deceleration parameter linked with 
the second derivative of the scale factor a. This is a dimensionless sca- 
lar that measures the rate of variation with time of the expansion (or 
contraction), and is defined by 


aa” 


q= . 
a!2 


(VII.5.13) 
An expanding accelerated universe has a negative deceleration param- 


eter. 
Taylor formula gives 


a(ts) = a(to) h + (ts — to)H (to) — sat? (ts —to)?+.. | (VII.5.14) 


and 
xs = a~t (to) [to ts + — (to — ts) +.. | : (VII.5.15) 
which gives the next approximation, 
z = H(to)dg + Zt + 1)H(to)d3 +... . (VII.5.16) 
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19Supernovae of type Ia have weak 
hydrogen lines and a strong Si line at 
6150 A. 
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20 Although there were, and still are, 
some cosmologists who believe in an 
eternal universe. 


VII.5.4 Age and future of the universe 


It is generally believed that the universe started with a big bang,?? 
the structure of which is a subject of active debate (see Section VII.7). 
We denote by t = 0 the time at which it is believed to have started 
expanding as a Robertson—Walker universe. We set a(0) = ao. We denote 
by tp the present time, that is, the age of the universe. By definition, 
a(tg) > 0; also, a'(to) > 0, since we observe (statistically) redshifts, not 
blueshifts—the universe is presently expanding. 

Until near the end of the twentieth centuries, it was thought for clas- 
sical physical reasons that the expansion of the universe has been, is, 
and will be always slowing down—that is, a(t) < 0 for all t > 0, and 
the curve is concave-downwards. The universe would end in a big crunch 
at a time t = T < oo. In fact, more recent observations have led to the 
belief that the expansion of the universe is now accelerating a” (to) > 0. 
Estimates of to and the future of the universe depend on the properties of 
the function a(t), that is, of the qualitative and quantitative properties 
of sources. 


VII.6 Friedmann—Lemaitre universes 


VII.6.1 Equations 


A Friedmann—Lemaitre cosmos is a Robertson—Walker spacetime whose 
metric satisfies the equations 


1 
Sap = — Agag + Top, with Sop = Rag — geek, (VIL.6.1) 
with Tag such that 
Too = #, Tyy=pa'yg, Ty =0. 


The non-zero Christoffel symbols of the spacetime metric (4.11), 
—dt? + a?y-, are computed to be 


d 
ry; =aa'yij;, Th, =a tað, Iin = Üh With a := aa (VII.6.2) 
where 7% p are the Christoffel symbols of the relevant metric ye. 

The non-zero components of the Ricci tensor of the spacetime metric 
are, using the value rj; = 2eyij for the Ricci tensor of the spatial metric 
Yes 

Roo = —3a7*a", Rij = (2e + aa” +207) yiz a” := > (VIL6.3) 
The scalar curvature is 


= —Roo + a?” Rij = 307? (2e + 2aa” + 2a”). 
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Hence the component Soo of the Einstein tensor reads as a first-order 
differential operator on a: 


1 = 
Soo = Roo 5 ook = 3a (e +a”) 


The corresponding Einstein equation?! gives u in terms of a, a’, and A; 
it is called the Friedmann equation: 


u= Soo — A = 3a7? (e +a?) — A. (VII.6.4) 
The equation 
YÍ Si; = Tij = 3A = 3pa? 


gives p in terms of a, a’, and a”, namely 


p= a? (e +a’? + 2aa") +A. (VII.6.5) 
Exercise VII.6.1 Prove (6.4) and (6.5). 


Exercise VII.6.2 Using the formulas obtained for u and p, prove that 
the stress—energy tensor satisfies the conservation laws, which read, since 
A is a constant, 


Vere’ =0. 


VII.6.2 Density parameter 


In discussing cosmological observations, it is useful to replace the dens- 
ity function u by a dimensionless quantity Q, called the ‘density 
parameter’, which is defined by 


oa 
3H? 


where H := a™ ta’ is the Hubble parameter. Note that Q depends only 
on cosmic time. The Friedmann equation (6.4) reads 


(VIL6.6) 


3 (a ’e + H?) =p+A, (VII.6.7) 
that is, 


Q=1+ (VIL.6.8) 


E 
a? H?’ 

If our universe is modelled by a Robertson—Walker cosmology, then 
the value of Q determines the type of Robertson—Walker spacetime in 
which we live: Q > 1 implies that £ = 1, with closed space sections; Q < 1 
implies € = —1, with the spatial sections being open if they are simply 
connected. The critical case Q = 1 implies € = 0, and hence locally flat 
space (Euclidean space if simply connected). Physicists argue that in the 
models we are studying, if Q > 1 by an appreciable amount at an early 
time, the universe will subsequently collapse in an extremely short time. 
If Q < 1, it will expand so fast that no stars could form. Thus one must 


211t is a constraint on a and a’, as pre- 
dicted from a general analysis of the 
Einstein equations. 
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suppose that was initially very close to 1; this is called the flatness 
problem. 


VII.6.3 Einstein—de Sitter universe 


At the present cosmic time, p is negligible with respect to p. 

The Einstein-de Sitter universe (found by Einstein and de Sitter 
in 1932) is a Friedmann—Lemaitre universe such that p = 0 (dust model), 
A = 0, and € = 0 (flat case). Then a satisfies the second-order differential 
equation 


a’ + 2aa” = 0, (VII.6.9) 
which can be integrated easily. Indeed, it can be written as 
n 1 / 
2aa” + a? = dad! (© | z) = 0, 
a 2a 
which is equivalent to 
d Jad 
J In(a’a2) = 0. (VIL.6.10) 


Hence, if a(0) = 0, it holds that 


2 = kt, k a constant. 


a 
By setting x := kr, the metric reads 
— dt? + k°t8 [dr? + r°(d0? + sin? 0 dd?)] . (VII.6.11) 


Exercise VII.6.3 Find this metric by using (6.5) and (6.6). 


VII.6.4 General models with p = 0 


In the general case with p = 0, one can use the Friedmann equation 


p= 307? (e+) — A (VII.6.12) 
and the conservation law 
d 3 
— =0 
gee) 
to obtain 
-1O + a?A 
a? = fle) = ae =e. Gsm aconstant. (VIL6.13) 


Hence one has the differential equation 


dt 1 


da \ f(a)’ 


for the function a ++ t(a), whose inverse gives the variation of a in 
function of t. 


In fact, it is now possible with a good approximation to deduce dens- 
ity from observations by a dynamical analysis, and the distance ay by 
using standard candles. Analysis of the redshift then gives the Hubble 
parameter H and the deceleration parameter q, and hence a’ and a”. 
It is found that today a’ is positive (expansion) and also a” > 0 (ac- 
celeration). The belief among most cosmologists is that at present the 
universe is in accelerated expansion. 


VII.6.5 ACDM cosmological model 


The standard model of the universe adopted at present by most cosmolo- 
gists is, up to small corrections, a Friedmann—Lemaitre universe; 
that is, a Robertson—Walker spacetime solution of the Einstein equa- 
tions with cosmological constant and source a stress—energy tensor of 
matter, with negligible contribution from radiation and neutrinos (even 
if the latter have a tiny non-zero mass). Observations of the motions of 
stars and galaxies using powerful Earth-based and satellite telescopes 
seem to imply that at a cosmological scale classical baryonic matter?? 
is a very small part of the energy content of the cosmos, around 4.5%. 
The observations indicate the existence of cold dark matter, estimated 
at about 25% of the universe energy content and probably composed of 
WIMPS (weakly interacting massive patricles) with only weak and gravi- 
tational interactions. The acceleration of the universe is explained in the 
ACDM model by the existence of dark energy represented by the term 
Ag, usually considered as an energy of the vacuum, of quantum origin 
and constant in spacetime, in agreement with the observational result of 
the constancy throughout spacetime of the fundamental dimensionless 
parameters, in particular the fine-structure constant (see Chapter IV). 


Remark VII.6.1 The expansion history for the ACDM cosmological 
model is presently obtained from the supernova distance-redshift relation 
deduced from observation of type Ia supernovae. It assumes the validity of 
the Friedmann equation. However, as pointed out by some cosmologists, 
the Friedmann equation itself has not been independently tested. 


VII.7 Primordial cosmology 


The general belief is that the cosmos started with a singularity?’ about 
14 x 10° years ago.?* There was a primeval phase?” for which the physics 
was very different from anything we have ever investigated experimen- 
tally. This physics, linked with the search for quantum gravity, is the 
subject of intensive investigation, mainly in the context of string theor- 
ies, with amazing results. In the most popular string theory, the universe 
is an (n+ 1)-dimensional manifold that has n — 3 of its spacelike dimen- 
sions compactified or a very small size (possibly of the order of the 
Planck length) and that is filled with one-dimensional strings. In some 
theories, space and time began to appear only after the Planck time.”6 
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22 Protons, neutrons, and electrons. 


?3The structure of the conjectured ini- 
tial singularity (big bang or oscillatory) 
has led to interesting mathematical 
work but remains a mystery. 


24This estimated age was augmented 
by a few million years after analysis 
in 2013 of data from the Planck sat- 
ellite, which led to a reduction of the 
value of the Hubble parameter (see 
Section VII.5) by about 10%. 


25The duration of this primeval phase 
is estimated as a Planck time, that is, 
about 107445 (see Chapter III). Note 
that giving an estimate for the duration 
of this preliminary stage is somewhat 
contradictory to the fact that time and 
space perhaps did not exist at this 
stage. 
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26Or rather no time less than the 
Planck time 10744 s exists. 


27The transition phase of inflation 
could have been a weakening of the 
inflaton called the slow roll. It is be- 
lieved that damping oscilations were 
probably present (see Damour and 
Mukhanov, 1998). 


28Protons and neutrons. 


29By a process that is not yet fully 
understood, as a result of the exist- 
ence at some time of a slightly greater 
number of baryons than antibaryons, 
most of the latter were annihilated, 
leaving behind an overwhelming pre- 
dominance of matter over antimatter in 
the universe. 


30In the radiation epoch, the photons 
lost energy by interations with elec- 
trons (Thomson scattering) and the 
universe was opaque. After recombin- 
ation and the capture of electrons by 
atoms, the universe became transpar- 
ent. 


For the next earliest phase, the favoured paradigm is that the universe 
(that is, a section t = constant of the Friedmann—Lemaitre model), en- 
tered a phase of quasi-exponential growth, called inflation, driven by an 
energy content approximately equivalent to a large cosmological term, 
a constant or a function of a scalar field, called the inflaton. Inflation 
is generally considered by cosmologists as solving the flatness problem, 
every region of space being stretched almost flat by the expansion. In- 
flation is also considered as solving the ‘horizon problem’, that is, the 
observed similarity in properties (cosmological principle) in regions of 
space so far apart now that no signal emitted near the big bang could 
have reached them both without inflation. 

The inflationary stage (during which the universe is similar to a de Sit- 
ter spacetime) was followed by a transition?’ to a radiation-dominated 
era, during which the universe was filled by an extremely hot plasma 
of quarks, gluons, leptons, photons, and neutrinos. The evolution of the 
universe was governed until a time of the order of 107° seconds by quan- 
tum mechanics, and the physics was more like that which is or soon will 
be accessible to laboratory experiments, albeit under extreme conditions. 

The universe was expanding and cooling; there came periods of 
so-called recombinations and the formation of baryons?® and an- 
tibaryons.?? A later recombination between ions and electrons led to 
the formation of atoms and molecules. The universe became transpar- 
ent.°° The cosmic microwave background (CMB) radiation observed 
now is thought to be made of the free photons surviving from this epoch. 
Finally, stars and galaxies appeared, and the universe as we know it was 
born at a time estimated at about 108 years. 

The modelling of these fascinating early phases is, however, outside 
the scope of this book. 


VII.8 Solution of Exercises VII.6.1 
and VII.6.2 


We need to show that in a Friedmann—Lemaitre universe 


= 3a? (e +a”) — A, (VIL.8.1) 
p= a7? (e +a? + 2aa") +A (VIL8.2) 
Val =0 (VIL8.3) 
Solution 
(VII.8.1): 
Soo = Too — gooA 
gives 


u= Soo — A = 3a? (e +a) — A. 


VIL.9 


(VIL8.2): 


a? 


2 


Si; = Rij Vig R = [2e + aa” + 2a’? — 3(e + aa" + a?)] Yiz 


When n = 3, 774i; = 3, and hence 


V9 Si; =3 (—e — 2aa” a’) = y°T,j — 3A = 3(pa? — A) 
implies 
p= a? (e ta 2aa") +A. 
(VIL8.3): The stress-energy tensor is such that T’° = 0, and hence 
VaT” = 8T" + rR 7s iT 


The non-zero Christoffel symbols of the spacetime metric are 


ry; = aa’y;;, To; = ata}, and Tin = Én Denoting by 
(3)V the covariant derivative in the metric y, we have 


Vol” =0;T"4 rR TI ' ro = VT" = %8; (pa?) = 0, 
because a and p depend only on t. We also have 
Vol = TV +3071 a'T +aa'qij T” = p'+3a71a'(w+p). 
Using the previous expressions for u and p, we find 
g H 
H = 6a ’ad' (e +a?) + 6a~7 aa", 
u+ p= 307? (e +a?) — a7? (e +a? + 2aa") 
sog. (e para aa") f 


and hence 
p + 3a7*a!(u + p) =0. 
Thus 


Val =0. 


VII.9 Problems 
VII.9.1 Isotropic and homogeneous Riemannian 
manifolds 


Isotropy of the space section M, at a point x means that there is no 
privileged direction in the tangent space Ty to M; at x. 


Problems 
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31See, for instance, see CB-DMI, p. 81 
or Bony (2001), p. 61. 


1. Show that if the Riemann tensor of the metric g takes at x the form, 
with K(x) some number, 


Fig nk(®) = K(£)(JihJjk — Jjhgik (£), (VII.9.1) 


it is invariant on the tangent plane at x under rotations centred at x. 


2. Show that the tensor (9.1) is locally homogeneous. 


Solution 


1. The tangent space to an n-dimensional Riemannian manifold at x is 
an n-dimensional Euclidean vector space. A rotation around its origin 
x is represented by an n x n matrix A (called an orthogonal matrix) 
whose elements Aj’ acting on tangent vectors X at z, Yp = Ap Xm, 


are such that 
a m = T 
D= AAA IX 
p p q 


that is, 


XC APAS =0, m#q, 
f (VIL.9.2) 
SAP Af=1, m=q. 
p 
In an orthonormal frame at x, the components of the Riemann tensor 
(9.1) read 
Rijn (£) = K(x) (bindik — Ojndix) (1). 


Under a rotation A, the transformed curvature tensor R’ has compo- 
nents 


Ry we (@) = K(x) Ai, Ad, AR, AL, (Oindje — jn dix) (2); 
that is, using the formulas (9.2), 
Rpj wk (x) => K(x) (din 057K! — Ôjiw irk ) (2). 


VII.9.2 Age of the universe 


1. Use the Taylor formula?! 


a(t) = a(ta) + a! (to)(¢— to) + (t— to)? f (1 —d)a"(ty + A(t — to)) @d 


to find estimates of the age tọ and the time T of the end of an ex- 
panding Robertson—Walker universe when we assume that a” (t) < 0 
for0 <t<T. 


2. What can we say if a” (to) > 0. 


Solution 


1. The Taylor formula implies 
a(0) = a(to) — toa’(to) + B fewa —r))d, O<AK<1. 


Hence, if a(0) = 0, a’(to) > 0, and a” (t) < 0 for t < to, then 


a(to) 
a' (to) 


Suppose that a” (t) < —C, some positive constant for t > to; then 


to 2 = H (to). 
/ 1 3 
a(t) < a(to) + a'(to)(t — to) — 5C(t — to)”, 
and hence a(T) = 0 (big crunch) as soon as 
il 
a(tg) + a’(to)(t — to) — 5Clt — to)? =0; 


that is, 


—a' (to) + \/a’?(to) + 2Ca(to) 
C 


2. The age tọ depends on the behaviour of a(t), and hence of a” (t) for 
0 < t < to. Assume a/(t) > 0 for 0 < t < tı; hence a(tı) > 0 and 
(inflation) a(t) > C > 0 for tı < t < to. Then the Taylor formula 
reads 


Ti 


a(to) = a(t) +a'(ta)(to—) +o)? | (1—A)a" (tı +A(to—t1)) dA, 


and implies 


[Q 


alto) > ~(to — t1)”. 


VII.9.3 Classical Friedmann—Lemaitre universes 


Consider a Friedmann—Lemaitre universe with A = 0 filled with a perfect 
fluid with equation of state p = (y — 1)u, with y a constant. Note that 
y = 1 corresponds to dust and y = 4 to pure radiation. 


1. Show that the velocity of acoustic waves is less than the speed of light 
if y < 2 (see Chapter IX). 

2. Show that in such a Friedmann—Lemaitre universe, it holds that, with 
C a constant, 


pa?’ =C. 


Therefore, the density u must decrease with increasing a. 


VIL.9 
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Solution 


If p = (y — 1)p, the equation (6.5) reads 


© (aS) + 8a"tal(y 1) = 0 
and simplifies to 

u +3a7ta'yu = 0. 
Hence, by integration, with C a constant, 


per =C. 


VII.9.4 Milne universe 


The Milne universe is a Robertson—Walker spacetime with € = —1, 
A = 0, space support R°, and is a vacuum (u = p = 0). The Milne 
universe, or analogous spacetimes with compact (non-simply connected) 
space sections, play an important role as asymptotic states in cosmology. 

Write the spacetime metric of the Milne universe. Show that it is 
locally flat, isometric to a wedge of Minkowski spacetime. 


Solution 


The Friedmann equation implies 


a? =1. 


Hence a’ = 1, and, up to the choice of label of the time origin, a(t) = t. 
The spacetime metric on the manifold R? x (0,00) then takes the form 


—dt? + t?[dr? + sinh? r(d6? + sin? 6 (d@”)], 


where r,0,ġ are polar (pseudo-)coordinates on R°. The space metric 
collapses for t = 0 and expands indefinitely when ¢ tends to infinity. 
The spacetime metric is locally flat, as can be seen by computing its 
Riemann tensor, for instance by using the 3+ 1 decomposition given in 
Chapter VIII. 


Part B 
Advanced topics 


This part provides a deeper study of the properties of general solutions 
of the Einstein equations either in vacuum or with stress-energy— 
momentum sources. Particular attention is paid to relativistic fluids and 
kinetic models, both of which have become important in astrophysics 
and cosmology. 


General Einsteinian 
spacetimes. The Cauchy 
problem 


VIII.1 Introduction 


The Cauchy problem for a partial differential equation of order m on 
R”+! with unknown a function f is the search for a solution such that 
f and its derivatives of order less than m take given values on an n- 


dimensional submanifold M. In coordinates x°, xt, i = 1,2,...,n, with 
M represented by x° = 0, the independent data are 
i ony a 
inet op PS m=] 


In the regular! analytic case, this problem has one and only one analytic 
solution in a neighbourhood of M. 

The stability? of the solution of the Cauchy problem, and its existence 
in the non-analytic case, depend crucially on its characteristic determin- 
ant, the polynomial obtained in replacing in the terms of order m the 
derivative ðf by a vector X. For a linear second-order equation 


of of 
id = + bY =h II.1.1 
 JxrðxP k Ox Pe , a ) 
the characteristic polynomial is the scalar function 

P(X) := a Xa Xg. (VIII.1.2) 


The properties of solutions depend essentially on the signature of this 
quadratic form. In a domain U of R"*", the equation is elliptic if P(X) 
is positive- (or negative-) definite; it is hyperbolic if it is of Lorentzian 
signature. 

For elliptic equations, the solution of the Cauchy problem exists in 
the analytic case, but is not stable: one says that the Cauchy problem is 
not well posed.’ Possibly well-posed problems for elliptic equations are 
global problems, for instance the so-called Dirichlet problem, which 
is the data of the unknown on the boundary of the domain.* 

For hyperbolic equations, the Cauchy problem is well posed in the 
analytic case, but also for more general functional spaces,’ in particu- 
lar smooth functions. For linear equations, the solution is global, i.e. it 
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!The regularity condition for a quasi- 
linear equation with analytic coeffi- 
cients and analytic data on M, repre- 
sented by x? = 0, is that the coefficient 
of the derivative 0 f /(Ax°)™ does not 
vanish for x° = 0. 


*That is, contiguous dependence on 
initial data. 

3A problem is said to be well posed if 
it has one and only one solution and 
this solution depends continuously on 
the data. 


4In this case, the well posedness de- 
pends also on the lower-order terms. 


5An important property for General 
Relativity, since the relativistic causal- 
ity imposes that a solution of a relativ- 
istic equation depends only on the past 
of this point, while analytic functions 
are entirely determined by their values 
in any open set. 
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6Diagonal in the highest-order terms. 


TThe introduction of V permits a glo- 
bal geometric formulation, impossible 
with V for the Einstein equations, 
where g itself is the unknown and sat- 
isfies Vg = 0. 


8In the classical physical case n+1 = 4, 
the number of independent coefficients 
Jap Of a general metric g is 10. 


9Einstein used them in the Minkow- 
skian approximation. It seems that 
de Donder was the first to introduce 
them for the full Einstein equations, 
shortly before G. Darmois. 


10In computations done in local co- 
ordinates (x*), it holds that ôa = 
0/Ox%. 


General Einsteinian spacetimes. The Cauchy problem 


exists on the whole of R"*?, although it may grow, even exponentially, 
with t = x°. Results extend to some (see J. Leray (1953)) quasidiagonal® 
systems and to systems on manifolds. That is, for linear second-order 
systems on a Lorentzian manifold (V"*", g) that read 

(VIIL.1.3) 


Q 


GOV gf + OS Vaf HA =h, 


the Cauchy problem with data on a spacelike section M” is globally 
well posed in relevant functional spaces, with relevant hypotheses on the 
coefficients and Cauchy data. In particular, the solution f? is smooth if 
(V"*1! g) is a smooth and regularly sliced Lorentzian manifold and the 
Cauchy data are smooth. An important property for Relativity is that 
the solution at a point depends only on the values of the initial data in 
the past of this point. 

A quasidiagonal, quasilinear system of second-order differential equa- 
tions on a manifold V”+! reads, in a domain U of local coordinates, 

GOV gf! +051 VF + hf? = hi, (VIIL.1.4) 
where g,b, and c depend on the unknown f and its first derivative. The 
covariant derivative V is taken in an a priori given’ metric g in U. It is 
always possible to take for g in U the flat metric, i.e. V, := 0/Ox%. The 
Cauchy problem for the system (1.4) is well posed for some initial data 
if the linear system obtained by replacing in the coefficients g,b, and c 
the quantities f and Of by their initial data is a well-posed system for 
these initial data. Note that the existence of the solution (which can be 
obtained by iteration), is in general for nonlinear equations only local in 
time. 

The Einstein equations in vacuum are a geometric system for the pair 
(V, g), invariant under diffeomorphisms of the manifold V and the asso- 
ciated isometries of the Lorentzian metric g. These equations constitute, 
from the analyst’s point of view, a system of (n + 1)(n + 2)/2 second- 
order quasilinear partial differential equations for the (n + 1)(n + 2)/2 
coefficients? of the metric in local coordinates. However, the equations 
are not independent, because of the contracted Bianchi identities, a con- 
sequence of the invariance under diffeomorphisms. We will see that the 
Einstein equations have both hyperbolic and elliptic aspects. 


VIIIL.2 Wave coordinates 


The Einstein equations are invariant under diffeomorphisms; to con- 
struct generic Einsteinian spacetimes, one fixes the local coordinates by 
a general condition, called a gauge choice. One then has to prove that 
the constructed spacetime satisfies the gauge condition. 

The results of the Minkowskian approximation have led various au- 
thors, over many years,’ to select in various problems what are now 
called harmonic or wave coordinates, that is, to impose that the scalar 
functions x ++ «® defining local coordinates satisfy the wave equations!° 


Fe -= ge. = gV 0,0" = 0. (VIII.2.1) 


In terms of the Christoffel symbols associated with the coordinates x, 
it holds that 


Fe = g™T$,. (VIII.2.2) 


The following identity results from a straightforward computation: 


Rep = RY faa (VIII.2.3) 
where 
1 
Lop = 5 (garOgF* + 98)0aF*) (VIII.2.4) 


vanish in wave coordinates while the ne are a system of quasilinear, 
quasidiagonal (i.e. linear and diagonal in the principal, second-order, 
terms) wave operators 


1 
h 
Rup = 59" Px9a8 + Pas(g, 89), (VIII.2.5) 


where P is a quadratic form in the components of Og, with coeffi- 
cients that are polynomials in the components of g and its contravariant 
associate. 

The system of partial differential equations 


h) 


R = pap (VIII.2.6) 


are called the (harmonically) reduced Einstein equations (the 
vacuum reduced Einstein equations if pag = 0). 


VIII.2.1 Generalized wave coordinates 


It is clear that one can also deduce from the Ricci tensor a quasidiagonal 
operator on the components of the metric, namely 


1 
REG” = Rag + 5 (GardsH + gaxða H>), (VIII.2.7) 


if the wave coordinate conditions are replaced by the more general ones!! 
Fa :=F°-H*%=0, with F° =gVT?, (VIII.2.8) 
where the H® are known scalar functions. Their advantage is that they 


contain a new set of functions H“, which can be freely specified, or 
eventually chosen to satisfy ad hoc equations. 


VIII.2 Wave coordinates 


11 Friedrich (1986). 
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12Gundlach, Calabrese, Hinder, and 
Martin-Garcia (2005), Pretorius 
(2005b), and Lindblom, Scheel, 
Kidder, Owen, and Rinne (2006). 


13For the Einstein equations with non- 
zero sources, one must solve the cou- 
pled system of these equations and 
the equations satisfied by the sources, 
which depend on their nature. 


14A constructive method inspired by 
representation formulas for solutions of 
linear equations given in dimension 4 
by Sobolev and in even dimension n + 
1 > 4 by de Rham (restricted to con- 
stant coefficients) was used for the first 
proof of existence of solutions of the 
Cauchy problem for quasilinear, qua- 
sidiagonal systems of wave equations 
by Fourés (Choquet)-Bruhat (1952) for 
the case n + 1 = 4 and by Fourès 
(Choquet)-Bruhat (1953) for the case 
n +1 > 4. The Leray theory of gen- 
eral hyperbolic systems with its energy 
method can also be used. The multi- 
plicity of roots of the characteristic 
determinant is irrelevant for the ap- 
plication of Leray theory because the 
characteristic matrix is diagonal. This 
multiplicity destroys the Leray hyper- 
bolicity when the characteristic ma- 
trix cannot be diagonalized into blocks 
with determinants with distinct roots. 
K. O. Friedrichs’s theory of first-order 
symmetric hyperbolic systems has no 
difficulty with multiple characteristics, 
but such first-order systems destroy the 
spacetime appearance of the problem 
and hide the propagation properties of 
the solution. Friedrichs hyperbolic sys- 
tems have no analogue for higher-order 
Leray hyperbolic systems. Moreover 
the weak hyperbolicity of Leray-Ohya 
(see Chapter IX), important for rela- 
tivistic causality, has no analogue in 
Friedrichs’s theory of symmetric hyper- 
bolic first-order systems. 


15For results in classical Sobolev spa- 
ces, see YCB-OUP2009, Appendix III. 


General Einsteinian spacetimes. The Cauchy problem 


VIII.2.2 Damped wave coordinates 


Recently, numerical analysts!? have been led to introduce reduced 
Einstein equations in a wave-related gauge, called damped wave 


coordinates. The operator pon is replaced by 


1 1 
a = RY} j; 5 (9a03H” + 980aH*) + 5 10F i1(98a%a 


+ goang — Jap); 


with n the unit vector normal to spacelike slices and yo a constant. 
The presence of this non-zero constant seems to stabilize the results of 
numerical calculations of the evolution system; it determines the time 
rate at which the gauge conditions FẸ = 0 are damped under evolution. 

The use of a wave gauge gives the basic elements for the proof of fun- 
damental properties of Einstein gravity using known results for partial 
differential equations. 


VIII.3 Evolution in wave gauge 


VIII.3.1 


We consider the vacuum case,!° that is, the search for solutions of a 
system of quasilinear wave equations. The natural well-posed problem, 
called the Cauchy problem, is the construction of a solution taking to- 
gether with its first derivatives given values on a submanifold M of V 
assumed to be spacelike for these initial values. If M has equation x° = 0 
in a local coordinate system and the analytic data are gag(0, x") (a quad- 
ratic form of Lorentzian signature) and (Ogag/0x°)(0,x"), then the 
manifold M is spacelike for these data if the initial data g (the induced 
metric on M with components g;;(0,2")), is positive-definite. Known 
theorems for quasidiagonal quasilinear systems of wave equations’ 
lead to the following theorem, valid in relevant functional spaces,!° 
in particular spaces of smooth functions with some finite number of 
derivatives. 


Solution of the reduced equations in vacuum 


Theorem VIII.3.1 The Cauchy problem for the vacuum Einstein 
equations in wave gauge is well posed; that is, it has one and only one 
local solution depending continuously on the initial data. This solution 
has the physically important property of exhibiting propagation of the 
gravitational field with the speed of light: its value at a spacetime point 
depends only on the data in the past of this point. 


The equations being nonlinear, the solution in general is defined only 
in a neighbourhood of the initial space manifold. The appearance of sin- 
gularities and their nature is a fundamental field of research in General 
Relativity. 
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VIII.3.2 Equations with sources 


In the presence of sources, the reduced Einstein equations contain other 
unknowns in addition to the metric g, namely fields or matter. The 
necessary conservation laws satisfied by the sources, V.7°? = 0, are 
equations that contain both g and these other unknowns; they must be 
coupled with the Einstein equations. The hyperbolicity of this coupled 
system with a dependance domain of a solution of the Cauchy problem 
determined by the null cones of the metric g is important for the sat- 
isfaction of the Einstein principle of causality (see Problems VIII.12.6 
and VIII.12.7 and Chapters IX and X). 


VIII.4 Preservation of the wave gauges 


A solution of the reduced Einstein equations that is a metric in wave 
gauge satisfies the full Einstein equations, and, indeed, for any solution 
of the reduced equations to be a solution of the full equations, it has to 
be in wave gauge, that is satisfy the equations 


F° = g™T$, =0. (VIII.4.1) 


We use the Bianchi identities to show that if g is a solution of the 
reduced Einstein equations'® in a wave (harmonic) gauge, then the 
gauge conditions satisfy a second-order linear quasidiagonal homoge- 
neous differential system, because of the Bianchi identities, which imply 
the following identities for the Einstein tensor S: 


Vas =0. (VIII.4.2) 
Indeed, if g is a solution of the equations in wave gauge 


Rak": (VIII.4.3) 


then it holds that, with T°? := p°? — 4 g°’ p the stress-energy tensor of 
sources, !” 


1 
gee — TP = = uaa + 90, F* — 9° 0,F*). (VIII.4.4) 


An elementary computation using the Bianchi identities and the conser- 
vation laws for the source T shows that, the functions F® then satisfy a 
linear homogeneous system of wave equations 


g°>02 Fo + ABO, F* = 0, (VIII.4.5) 
where the A’s are linear functions in the Christoffel symbols of g. 


Exercise VIII.4.1 Compute the coefficients A. Write similar equa- 
tions in the cases of generalized and damped generalized wave 
coordinates. 


16This preservation of the wave gauge 
property holds for Einstein equations 
with sources satisfying the conservation 
laws. 


17Recall that we use geometric units 
where Gg = 1; equivalently, in the case 
n+1=4, 8rGy = 1. 
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The homogeneous wave equations (4.5). imply F® = 0 if the initial 
data for g are such that the F®“ and their first derivatives vanish on the 
initial manifold M; that is, 


F°\u =0and OoF | = 0. (VIII.4.6) 


Corresponding properties hold for the generalized and damped wave 
gauges. 


VIII.4.1 Wave gauge constraints 


We will deduce from (4.6) relations that must hold between initial data 
on the submanifold z? = 0 for the solution of the reduced Einstein 
equations to be a solution of the full Einstein equations, that is, to be a 
metric in wave gauge. The simplest way to find these relations is to use 
the contravariant tensor densities 
G% := |Det g|? g°®. (VIIL.4.7) 

It holds that 

F° := g¥TS = —|Det g7 70,97. (VIII.4.8) 
Exercise VIII.4.2 Prove this identity. 
Hint: The derivatives of the determinant of g are Og(Detg) = 
(Det 9) 9 Oag9au- 


Straightforward computation using the identity (2.3) satisfied by the 
Ricci tensor shows that the Einstein tensor S' satisfies the identity 


1 a 1 a e Q 
So = RP zR” = Se | 5 (g AO, FP + g?*A, F% — g °0, F?) i 
(VIII.4.9) 


where the Sty are a quasidiagonal system of wave operators in the metric 
g for the Gor | 

Suppose that F|;=,0-,, = 0. Then also 0;F°|,0-,, = 0, and the 
identity (4.9) implies the equality 


1 
S| = Shyla + 59 IoF*)|u- (VIII.4.10) 


The expressions for saa and F® show that $°°|,, does not contain any 
second time derivative of g; that is, the equations 


Ce = 9, — Ty = 0 (VIII.4.11) 


are constraints on the initial data. If they are satisfied, then a solu- 
tion of the reduced Einstein equations with these initial data, i.e. such 
that Salm — T®| = 0, satisfies also Op F°|\z. We have proved the 
following theorem: 

Theorem VIII.4.1 A solution of the reduced Einstein equations with 
initial data such that F®”|m = 0 satisfies the full Einstein equations if 
and only if the initial data satisfy the constraints C% = 0. 


An analogous theorem holds for the other types of wave gauges. 
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VIII.5 Local existence and uniqueness 


An Einsteinian spacetime is constructed in local coordinates,!® in a 
neighbourhood of a hypersurface M with equation z? = t = to, from 
initial data gag|m, a Lorentzian metric for which M is spacelike, and 
(ogag) |m. 


Theorem VIII.5.1 (Fourès (Choquet)-Bruhat, 1952) 


1. The Cauchy problem for the vacuum Einstein equations with initial 
data on a spacelike hypersurface gag|m and (O:gag)|m satisfying the 
wave gauge constraints has a local-in-time solution, a Lorentzian met- 
ric in wave gauge depending continuously on the initial data. Its value 
at a given point depends only on the past of this point. 


2. The solution is locally geometrically unique. 


Part 1 of this theorem is a consequence of Theorems VIII.3.1 and 
VIIL.4.1. 

For Part 2, we first recall that the solution constructed in wave gauge 
from given initial data gag|m, (Oo9ag)|r satisfying the constraints is 
unique. To show local geometric uniqueness, we show that any given 
vacuum Ejinsteinian spacetime defined in a neighbourhood of M can be 
put in wave gauge in a possibly smaller neighbourhood of M, taking 
initial data that depend only on the original data. We will state a more 
general, global, uniqueness property after definining geometric initial 
data. 


VIII.6 Solution of the wave gauge 
constraints 


The expressions for S°° and F® show that the wave gauge constraints 
are of the form, for the Einstein equations with source the stress—energy 
tensor T, 


C= 59 FG — z9 agh FGPG + -T Z 0, (VIII.6.1) 


where K°° depends only on G and its first derivatives. 


Remark VIII.6.1 Giving G°?,0,G°° for t = to is equivalent to giving 
Jos, tJa, for t = to. 


The constraints appear as a system of n+1 equations for (n+2)(n+1) 
unknowns, G°?, 0,G%%, on the manifold z? = 0. This system, with more 
unknowns than equations, is undetermined. It is natural to try to split 
the initial data into specified quantities and n+1 unknowns, for which we 
wish to find a well-posed system of partial differential equations, which 
we expect to be elliptic because this is a pure, non-evolutionary, space 
problem. This was done for the first time in the general caset? by using 
the equations (6.1). Two choices were suggested for splitting the initial 
Cauchy data between unknowns and given data: 


18See the geometric intrinsic formula- 
tions in Section VIII.7. 


19Bruhat (Choquet-Bruhat) (1962). 
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20The principal symbol of a scalar par- 
tial differential operator is the poly- 
nomial obtained by replacing in the 
highest-order terms the partial deriva- 
tive 0; by the component £; of a vector. 
In the case of a system, the principal 
symbol is a matrix (see the example 
below). 


21A system of partial differential oper- 
ators is elliptic if its principal symbol 
is an isomorphism (i.e. an invertible 
square matrix) for each non-zero vec- 
tor €. For one scalar equation, this 
means that the principal symbol does 
not vanish for any non-zero set of real 
numbers €;. 


(1) Give arbitrarily on the initial manifold M the quantities G°’. The 


jG% are then known and the 0,G°° are determined by the har- 
monicity conditions. The remaining unknowns, in this first step, are 
u := G™ and v4 := 0G). Equations (6.1) can be written in the 
following form: 

e Fora =0, 


c° = 5g 03g” — A =0, (VIIL6.2) 


where 


1 , i 
A:= z9 0ng + G°A2G™ + K — 7° (VIII.6.3) 
depends only on the specified quantities and on the space deriva- 
tives of the unknowns of order at most 1 


e Fora=h, 


ch = 59 0:(0.9"") 4 Bh =0 (VIII.6.4) 


where 


lgo? gro at oO. gro 4 Ko — gao, (VIII.6.5) 
Equation (6.2) is a semilinear second-order equation for G°° with 
principal symbol”? the positive-definite quadratic form G” &&;; it 
is an elliptic?! equation. 

To write the system (6.4) of n equations with the n(n + 1)/2 
unknowns X” := 0,G" as an elliptic system for n unknowns 
and n(n — 1)/2 specified functions, one can use the Berger—Ebin 
decomposition theorem for a functional space E as 


E = ker D + range D*, 


where D and D* are a differential operator and its adjoint, one of 
which has injective symbol; the product DD* is then elliptic. Since 
in this section we work in local coordinates, we simply recall here 
the classical procedure for decomposing a symmetric unknown X“ 
on R”. We set 

X” =Y" 4 Z“, with &Y” =0 and ZË = UÍ + 8U’. 
Equations (6.4) take the form 


X BU + 8,0;U' = fi. 
They imply 
DF (0;U7) = la, 


which is an elliptic equation for 0;U? and an elliptic system for U/ 
when 0;U* is known. 
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(2) In the spirit of Wheeler’s ‘thin-sandwich conjecture’, we give ar- 
bitrarily on M the quantities Gf and ôG". The harmonicity 
conditions then determine ôG” = —0;G7" and imply the relation 


ag = -—a,g™. (VIII.6.6) 


The constraints now read, without needing the introduction of a 
further splitting, as a system of n+1 equations for the n+1 unknowns 
Gg”, g% : 


ch = 5g 0} g” — F" =0, (VIII.6.7) 


Weck . 
C= e — g0? g — F° = 0, (VIII.6.8) 


where F? and F” depend only on the given quantities and on the 
first space derivatives of the unknowns. 

The system is elliptic because of the expression of its characteristic 
determinant, the determinant of its principal symbol. 


Exercise VIII.6.1 Show that this determinant is proportional to a 
power of the determinant of a Laplace operator. 


Hint: The principal part in the equation C® = 0 contains only G. 


The solution of an elliptic system on a manifold is a global problem. 
Two cases are of particular interest in General Relativity: the case of 
asymptotically Euclidean manifolds and the case of compact manifolds. 


VIII.6.1 Asymptotically Euclidean manifolds 


A particularly physically relevant case is the study in Einsteinian gravi- 
tation of isolated systems, composed of a few bodies far from any other 
source of gravitation, so that the metric tends to be flat far away from 
the studied system. Such systems are modelled in general by asymp- 
totically Euclidean manifolds, defined in Chapter IV. For mathematical 
study of solutions of elliptic differential equations on asymptotically Eu- 
clidean manifolds, one uses either weighted Hélder spaces”? or weighted 
Sobolev spaces.?3 

The linearization of the nonlinear constraints (6.7) and (6.8) for given 
Euclidean data reduces to the Euclidean Laplace equations, for which 
solutions are known in the mentioned functional spaces. The nonlinear 
constraints have been shown to have a solution? for given data near 
Euclidean data. 

Various numerical methods are available for solution of the relevant 
equations. It would be interesting to solve them numerically, their solu- 
tion giving initial data directly for the harmonically reduced evolution 
equations, which form a well-posed system. 


?2Choquet-Bruhat (1974), Chaljub- 
Simon and Choquet-Bruhat (1979). 


?3Cantor (1979), Choquet-Bruhat and 
Christodoulou (1981). 


24Vaillant-Simon (1969). 


184 General Einsteinian spacetimes. The Cauchy problem 


25 Fourès (Choquet)-Bruhat (1956). 
26 Arnowitt et al. (1962). 


VIII.6.2 Compact manifolds 


Mathematicians are especially interested in the case of compact space 
manifolds without boundary, where topology plays an important role. 
Solutions of the constraints on compact manifolds have been exten- 
sively studied, but after writing them as elliptic equations for geometric 
unknowns—the induced metric gand extrinsic curvature K. 

Compact space manifolds are of interest as cosmological models. 
However, the choice of a model for our whole cosmos is very speculative. 


VIII.7 Geometric n + 1 splitting 


We will in the following sections formulate geometric, i.e. coordinate 
independent, theorems about the existence and uniqueness of solutions 
of the Cauchy problem. We first give the following decomposition,?° 
often called the ADM decomposition, although Arnowitt, Deser, and 
Misner?® are only accountable for its Hamiltonian interpretation. This 
decomposition has been for many years intensively used for numerical 
computation, in spite of the fact that the evolution equations that it 
gives are not a hyperbolic system. 


VIII.7.1 Adapted frame and coframe 


We consider a spacetime with manifold V = M x R and hyperbolic metric 
g such that the submanifolds M, = M x {t} are spacelike. We take a 
frame with space axes e; tangent to the space slice M, and time axis 
eo orthogonal to it. Such a frame, particularly adapted to the solution 
of the Cauchy problem, is called a Cauchy adapted frame. We take 
local coordinates adapted to the product structure, (x®) = (xt, x° = t), 
and we choose for e; the vectors 0/ Ox* of a natural frame on M;, i.e. 


0; = 0/dz'. (VIIL.7.1) 


The dual coframe is found to be such that, with 8 a time-dependent 
vector tangent to M+ called the shift, 


6° = dz? + dt, (VIII.7.2) 

while the 1-form 9° does not contain dx’. We choose 
9° = dt. (VIII.7.3) 

The vector eo, i.e. the Pfaffian derivative Oo, is then 
ð =O — 0;, with ô := 0/dt. (VIII.7.4) 


The vector eo is timelike since it is orthogonal to spacelike surfaces. We 
suppose that it defines the positive time orientation. In the coframe 0%, 
one has gio = 0, and the metric reads 


ds? = -N° (0°)? + gi;6°07, 0° =dt, 6° =dx'+ dt. (VIII.7.5) 


VUI.7 Geometric n + 1 splitting 


The function N is called the lapse. We shall assume throughout that 
N > 0. The time-dependent, properly Riemannian, space metric induced 
by g on M; is denoted by either g or g. An overbar denotes a spatial 
tensor or operator, i.e. a t-dependent tensor or operator on M. Note that 
in our frame, Jij = gij and g’ = g”. 

A spacetime (V, g) with V = M x R and metric (7.5) is called a sliced 
spacetime. 

The non-zero structure coefficients of a Cauchy adapted frame are 
found to be 


Ci; = -Cio = 956". (VIII.7.6) 


We denote by V the covariant derivative corresponding to the space 
metric g. Using the general formulas from Chapter I, we find that 


wi, =T}, = M, (VIILY.7) 


woo = Ngd,N, Whi = wip = NHAN, woo = NON, 
and 
N=? (Bogig + 9ngCi + GinCFo) » (VIII.7.8) 


from which we obtain 


w9, 


N? ogiz, (VIII.7.9) 


1 
j~9 
where the operator ðo is defined on any t-dependent space tensor T by 
the formula 


QO. .- 
bo = 5 — Le, (VIII.7.10) 


where Lg is the Lie derivative on M; with respect to the spatial vector £. 

Note that OoT is a t-dependent space tensor of the same type as T. 
The extrinsic curvature (also called the second fundamental 

form) of M; is the t-dependent symmetric space tensor K given by 


1 _ 
K; = -3N ogi = —whno = —Nwy,. (VII.7.11) 


The remaining connection coefficients are found to be (indices are raised 
with g) 


w =-NKj +36, who =-NK}. (VIIL.7.12) 


The trace in the metric g of the extrinsic curvature, often denoted 
by 7, is called the mean curvature of M; : 


T =trzK = 9G" Kij. (VIII.7.13) 
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K andr play an important role in the initial-value formulation and other 
geometric problems of General Relativity. 


Exercise VIII.7.1 Show that the extrinsic curvature K of M, is the 
projection on M; of the spacetime gradient of the past-oriented unit nor- 
mal n to Mi (some authors take the opposite orientation of n to define 
K). Show that T := Tr3K of K is equal to the spacetime divergence of n. 


Remark VIII.7.1 The tensor K is symmetric. It can equivalently be 
defined, up to a factor 1/2, as the Lie derivative of the spacetime metric 
in the direction of n. It does not depend on the value of n outside M. 


Remark VIII.7.2 A positive value of T signals a positive divergence 
of the past-directed normals. A negative value of T corresponds to con- 
vergence of the past-directed normals, and hence future expansion of the 
submanifolds M;. 


We deduce from the general formula giving the Riemann tensor and 
the splitting of the connection the following identities: 


Rg m = Rym + Kir Ki — Kuk k (VIII.7.14) 
Roigk = N(V Kui — VkKji), (VIII.7.15) 
Roi,oj = N(OoKij + NK. K*; + Vid;N) (VIII.7.16) 


From these formulas, we obtain the following expressions for the Ricci 
curvature: 


NR = NRij — Kij + NKij KP — 2NKi,K*; — Vi0j;N, (VIL.7.17) 
N`! Ro; = 0, KP — VK", (VIII.7.18) 
Roo = N(OoKf — NK; K” + AN). (VIII.7.19) 
Also, with R := g Rij, 
g Ri = R- NO KP + (KP)? — N'AN, (VIII.7.20) 
R=-—N~?* Roo +9" Rij = R+Kij K+ (KP)? -2N 10). KR-2N—'AN, 
(VIII.7.21) 


and 


1 1 = 
Soo = Roo 5 Jook = 5 (Roo + g” Hay): 
It follows that 


2N? Soo = -299 = R— Ky K” + (KP). (VIII.7.22) 
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VIII.7.2 Dynamical system with constraints 
for g and K 


We see in the above decomposition of the Ricci tensor that none of the 
components of the Einstein tensor contains the time derivatives of the 
lapse N or the shift 8. One is thus led to consider the Einstein equations 
as a dynamical system with constraints for the two fundamental tensors 
g and K of the space slices M;, while N and 8 are gauge variables.?7 


Constraints 

The expressions for the Ricci tensor lead to the following constraints 

for g and K on space slices M, for solutions of the Einstein equations:?% 

Hamiltonian constraint 

2N? So = —290 = R — K K“ + (KP)? =p, with p:= —2T®; 

(VIII.7.23) 

for the usual classical sources, p > 0. 

Momentum constraint 

N Soj = N7'Roj = Oj; Kp —-VnK"; = Jj, with J; = -N~ Toj. 
(VIII.7.24) 


These constraints coincide with those found previously by using the 
wave gauge and shown to be preserved under evolution for stress—energy 
tensor sources satisfying the conservation laws. 


Evolution 


The equations 


Ri = Ry - — 2K yn KP + Ky KP - x = pij, (VIIL7.25) 
together with the definition 
ðogij = —2N K;ij, (VIII.7.26) 


determine the derivatives transverse to M; of g and K when these tensors 
are known on M;, as well as the lapse N and shift 3 and the source pij. 
It seemed natural to look at these equations as evolution equations de- 
termining g and K, while N and 8, which are projections of the tangent 
vector on the timeline respectively on eo and on the tangent space to M, 
are considered as gauge variables. These evolution equations have been 
used extensively in the past for numerical computations. The Cauchy 
problem for these equations is clearly well posed for analytic data—a 
physically unsatisfactory condition because analytic functions have non- 
localizable support. The data of lapse and shift does not seem to be a 
good gauge choice. However, it can be proved?? that, given N and £8, the 
operator Rij given by (7.25), with Kj; given by (7.26), is a second-order 
differential system on the gij that is Leray—Ohya*’ hyperbolic and 


27Darmois (1927) was the first to view 
the Einstein equations (in a Gaussian 
gauge N = 1, 8 = 0) as an evolution 
system for g, K supplemented by initial 
constraints. 


?8These constraints correspond to the 
Gauss and Codazzi equations well 
known to geometers. 


29See YCB-OUP2009, Chapter VIII, 
Section 3. 


3°See J. Leray and Y. Ohya (1968). 
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31Gevrey classes are C% functions 
whose derivatives satisfy inequalities 
weaker than those satisfied by analytic 
functions and assure in this case the 
convergence of their expansion in Tay- 
lor series. In contrast to an analytic 
function, a Gevrey class is not deter- 
mined by the values of all its derivatives 
at a given point, nor by its value in an 
open set; it can have compact support. 


32Choquet-Bruhat and Ruggeri (1983) 
for zero shift; Choquet-Bruhat and 
York (1996) for arbitrary shift. 


33Called an algebraic gauge in the first 
formulation of Choquet-Bruhat and 
Ruggeri, but now called densitizing the 
lapse; it is equivalent to a wave gauge 
choice for the time variable. 


34Christodoulou and Klainerman 
(1989). 


35Choquet-Bruhat and Noutchegueme 
(1986) and, independently, Anderson 
and York (1999). 


36A system of N first-order partial 
differential equations in N unknown 
scalar functions of n + 1 variables with 
principal part 


J 
Ae oJ =1,...,N 
x 


is symmetric hyperbolic if the matrices 
A® with (real) elements A are sym- 
metric and the matrix A? is positive- 
definite. The Cauchy problem for such 
systems has one and only one so- 
lution, in relevant functional spaces 
(Friedrichs, 1954). See, for instance, 
YCB-OUP2009, Appendix IV. 
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causal. The Cauchy problem for such systems has solutions in Gevrey 
classes,*! causally determined from initial data. However, the infinite 
number of derivatives required from a Gevrey class foreshadow the poor 
stability of results of numerical computation, as has been confirmed by 
numerical analysis. 

More promising, appropriate combination of the evolution equation 
and the constraints have been shown to lead,*? modulo a gauge choice? 
for N, to a quasidiagonal system of wave equations for K that together, 
with (7.1), give an hyperbolic causal system for g and K. 

Another possibility, leading to a mixed elliptic-hyperbolic system, is 
to impose on N an elliptic equation that implies that the trace of K 
takes a given value.*4 


Preservation of constraints 


As shown before for the wave gauge constraints, the geometrically formu- 
lated constraints (7.23) and (7.24) are preserved under evolution when 
the stress—energy tensor source satisfies the conservation laws. A simple 
proof follows.®° 


Theorem VIII.7.1 [f Ri; — pij = 0 holds, then the constraints satisfy 
a linear homogeneous first-order symmetric hyperbolic system.” If they 
are satisfied initially, then they are satisfied for all time. 


Proof. If Ri; — pij = 0, then we have, in the Cauchy adapted frame, 
with p := g% pag, 


R-p= =N? (R — p0). 


Hence 


Sear = Aa p°) and R—p = —2N°(9% —T®) = 2(S)-TQ) 
and 


Sti — T" = 


ia Ea 
37” (R-— p) = —3” (S6 — To). 

With these identities, we may derive from the Bianchi identities and the 
conservation laws a linear homogeneous system for 44 = Sg — To and 


for X9 = S9 — Tf with principal parts 

N~?d9x% + g“ oxo, and E + 0:24. 
Since this system can be made symmetric hyperbolic, it has a unique 
solution, which is zero if the initial values are zero. E 


Exercise VIII.7.2 Show that Xj}, and E} satisfy a quasidiagonal 
homogeneous second-order system with principal part the wave operator 
of the spacetime metric g. 
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VIII.7.3 Geometric Cauchy problem. Regularity 
assumptions 


To give a geometric formulation of the solution of the Cauchy problem 
for the Einstein equations with data on a manifold M, we introduce the 
following definitions (in the usual case, n = 3). 


Definition VIII.7.1 


1. An initial data set is a triple (M,g, K) with (M,g) a Riemannian 
n-dimensional manifold and K a symmetric 2-tensor on M. 


2. An Einsteinian vacuum development of (M,g,K) is a Lorentzian 
(n+1)-manifold (V, g) solution of the vacuum Einstein equations such 
that M is an embedded submanifold of V and g is induced by g on 
M, while K is the extrinsic curvature of M in (V,g). 


The following existence theorem holds as a direct consequence of previ- 
ously stated and known results on hyperbolic differential equations. The 
weakest hypotheses on initial data using only classical Sobolev spaces 
and energy estimates are when n = 3, g € H}°°(M), K € H}°°(M). 
Recall that H!°°(M) is a space of tensor fields with components that, 
along with their derivatives of order less than or equal to s, are square- 
integrable in relatively compact domains of local coordinates covering 
M.°" The local uniqueness up to isometries is a consequence of the 
geometric uniqueness given in Theorem VIII.5.1. 


Theorem VIII.7.2 (existence and local uniqueness up to isometries) 
An initial data set (M,g,K) for the Einstein vacuum equations satis- 
fying the constraints admits a vacuum Einsteinian development (V, g). 
This development is locally unique, up to isometries. 


Remark VIII.7.3 The hypothesis g € H! when n = 3 is not suffi- 
cient to ensure the uniqueness of geodesics issuing from one point with 
a given tangent. This uniqueness would be ensured if the Christoffel 
symbols were Lipschitzian, which is not the case for Hy metrics on 
three-dimensional space. 


Lowering the regularity required of data is conceptually important for 
the understanding of the mathematics and the physics of the theory, 
and essential in the study of global problems. Geometric hyperbolic evo- 
lution systems involving the Riemann curvature tensor instead of wave 
gauges have been considered and have allowed for the broadening of ad- 
missible functional spaces for the initial data. Klainerman, Rodnianski, 
and Szeftel°® have conjectured that sufficient conditions for the existence 
of a Lorentzian metric solution of the vacuum Einstein equations in a 
four-dimensional neighbourhood of a 3-manifold M supporting initial 
data (g, K) are that Ricci(g) € L?, that VK € L? locally on M, and 
that the volume radius®® of (M,g) is strictly positive. In several long 
and difficult papers using refined functional analysis, they have proved 
their conjecture interpreted as a continuation argument for the Einstein 
equations; that is, the spacetime constructed by evolution from smooth 


37Various, more intrinsic, equivalent 
definitions can be found, for instance, 
in YCB-OUP2009, Appendix III, Sec- 
tion 3.7. 


38Klainerman, Rodnianski, and Szef- 
tel (2012). 


39The volume radius is the lower bound 
of the quotient by r? of the volume of 
geodesic balls of radius r in (M, g). 
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407 ichnerowicz (1944) for r = 0; York 
(1972) for T = constant. 


“1 Choquet-Bruhat (1971) for r = 0; 
York (1972) for T = constant. 


42The case of arbitrary n is treated in 
detail in YCB-OUP2009, Chapter VII, 
Section 3. 


data can be smoothly continued, together with a time foliation, as long 
as the curvature of the foliation and the first covariant derivatives of 
its second fundamental form remain L?-bounded on the leaves of the 
foliation. 


VIII.8 Solution of the constraints 
by the conformal method 


The conformal method, initiated by Lichnerowicz and developed by 
Choquet-Bruhat and York, has been intensively used to prove existence 
of solutions of the constraints. Variants of it are now used in numer- 
ical computations of initial data for the two-body problem. It has the 
advantage of turning the momentum constraint into a linear system in- 
dependent of the conformal factor for a weighted extrinsic curvature 
when the mean extrinsic curvature T is zero or constant.4° The con- 
formal factor then satisfies a semilinear elliptic operator with principal 
part the Laplace operator of the conformal metric. Given a TT (trans- 
verse traceless) tensor, the momentum constraint can be written as a 
linear elliptic system for a vector field.44 The method was used in early 
numerical computations, essentially by taking conformally flat metrics. 


VIII.8.1 Conformally formulated (CF) constraints 


When the sources p and J are known, the unknowns in the constraints 
(7.24) and (7.23) are the metric g and the tensor K. It is mathematic- 
ally clear that these scalar and vector equations do not have a unique 
solution, even geometrically. Physically, this fact corresponds to the 
property of Einsteinian gravity that it is its own source. Roughly speak- 
ing, ‘radiation data’ should also be given, the constraints then becoming 
an elliptic system for a scalar function and a vector field. The geometric 
meaning of the lower-order terms permits discussion of the existence and 
uniqueness of solutions on general manifolds. We treat for simplicity of 
notation the usual physical case n = 3.47 
The Hamiltonian constraint (7.23) reads, for general n, 


R- (|K +p-7°)=0, R:= RG), 7 := try K. (VIIL8.1) 


This equation is turned into an elliptic equation for a scalar function y 
by considering the metric g as given up to a conformal factor. In the 
case n = 3, one sets 


Jij = P ijs 
with y a given metric on M and y a function to be determined. 


The scalar curvatures of the conformal metrics g and y are found by 
straightforward computation to be linked when n = 3 by the formula 


RG) = 9° [pR(7) — 8474] . (VIII.8.2) 


VIIIS Solution of the constraints by the conformal method 


Exercise VIII.8.1 Prove an analogous formula for general n > 3 with 
4/(n—2) 
p : 


Hint: The choice y* when n = 3 is made to eliminate quadratic terms 
in first derivatives of y. 


The identity (8.2) implies the following theorem: 


Theorem VIII.8.1 When y, K, and the source p are known, the Ham- 
utonian constraint is a semilinear second-order elliptic equation for y, 
linear also in first derivatives, which, in the case n = 3, reads 


8A, — R(y)y + (K|? — 7? + p)y® = 0. (VIIL.8.3) 


To solve the momentum constraint, we split the unknown K into 
a weighted traceless part K and its trace 7. In the case n = 3, we set 


254 1_ 
Keg =p “Kij + 3 GigT- (VIII.8.4) 


Equivalently, with indices in K and K being respectively lifted with g% 
and 7, 


T: es ll M m 
KY = yK" + 397 gi =p ty” . (VIII.8.5) 
The tensor K is symmetric and traceless; indeed, 
> Seas af 1 
try k= y“ Kij = yp 2g (xa = Zaur) = (); 


Straightforward computation shows that the momentum constraint 
becomes, with D the covariant derivative in the given metric y, 


Sune =D > 
D,KY = ge y Or +o Ty. (VIII.8.6) 


This equation has the interesting property that it does not contain y 
when J is zero and 7 is constant on M, that is, when M is be a subman- 
ifold with constant mean extrinsic curvature in the ambient spacetime. 
We have the following theorem: 


Theorem VIII.8.2 In the case T = constant and J = 0, the symmet- 
ric 2-tensor K“ := p-!°K" is a TT tensor (transverse, traceless) and 
the momentum constraint reduces to linear homogeneous sytem for K: 


D;K4=0, yK” =0. (VIIL.8.7) 


Exercise VIII.8.2 Show that the space of TT tensors is the same for 
two conformal metrics. 


Returning to the Hamiltonian constraint, we compute, using the 
definition of K, 


ae 
JK |g = 9*9" Ku Ku = po PIK + gt. (VIIL8.8) 


191 


192 


431t was obtained by Lichnerowicz 
(1944) for n = 3 and extended to 
general n by Choquet-Bruhat (1996). 
Scaling of the sources was introduced 
by York (1972). We will still refer to 
as Lichnerowicz equations all the equa- 
tions deduced from the Hamiltonian 
constraint by the conformal method. 


44Vork (1972). 


45Choquet-Bruhat (197la) and York 
(1972); Fischer and Marsden (1979), 
Choquet-Bruhat and York (1980), 
Choquet-Bruhat, Isenberg and York 
(2000) and see further references in 
Choquet-Bruhat and York (2002). For 
the coupling with a scalar field see 
Choquet-Bruhat, Isenberg and Pollack 
(2007). 
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Exercise VIII.8.3 Prove this formula. 
Hints: K is traceless and yak ak = one = 3: 


Exercise VIII.8.4 Write down the corresponding formula for arbi- 
trary n. 


The Hamiltonian constraint now reads 


ih oe 2 
8A, — Rye + o "| KIZ + (-37 + p) p =0. (VIIL.8.9) 
It is a semilinear elliptic equation for p when y (a Riemannian metric), 
T, p, and K are known. It is called the Lichnerowicz equation.** 
A scaling of the momentum of the sources by setting, when n = 3,44 


Japs, 


with J considered as a known quantity, permits the extension of the de- 
coupling property to the non-vacuum case. The momentum constraints 
now read 


DŘ = Ï. 


The scalar part p of the sources can also be scaled. The scaling proposed 
by York, justified by physical considerations (at least in the case n = 3 
for electromagnetic field sources), is 


p= "ĵ, 
with p considered as a given function. The Hamiltonian constraint then 
reads 


er oer me. 
8A,y — R(y)¢ + |K pT + pe? — 37 =0. (VIII.8.10) 


VIIIL.8.2 Elliptic system 


We complete the writing of the CF constraints as an elliptic partial 
differential equation system on the initial manifold by a treatment of the 
momentum constraint analogous to that indicated in Section VIII.6. 
We can split the tensor K into an element of the kernel of the homogen- 
eous part of the momentum constraint operator, i.e. a TT tensor Y for 
the given metric y, and an element of the range of the dual operator, 
the conformal Lie derivative Z of a vector field X; that is, we set 


Ka =Y; + Zi, with DY =0, (VIII.8.11) 
and, when n = 3, 
2 
Zij = (Ly cont X)ij = DiX; + D; X: zw DX". (VIII.8.12) 


A tensor of this form satisfies the momentum constraint if and only if 
the vector X satisfies the second-order system 


Ay cont X := D+ (Ly cont X) = D- J. 
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Exercise VIII.8.5 Show that this system is a linear elliptic system 
for X when y and J are known. 


The search for the arbitrary TT tensor Y;; can itself be done through 
the data of an arbitrary traceless tensor Ui; by setting Y := Ly conf V +U 
and imposing the requirement that the vector V satisfy the following 
elliptic system: 


Ay contV =D. (Ly,contV) =-D.-U. 


The traceless tensor Y is then also transverse, i.e. satisfies D- Y = 0. 
The arbitrary data in the extrinsic curvature K is in this scheme 
the symmetric traceless tensor U*). The physical extrinsic curvature is 
given by 


Ki = p”? [Lyco X +V)” + u"| T 39 T. 


The splitting of the solution K into a given traceless tensor U and the 
conformal Lie derivative of an unknown vector X depends on the choice 
of y, not only on its conformal class.*6 


Exercise VIII.8.6 Prove this statement. 


VIII.8.3 Physical comment 


The conformal method is a mathematical, geometric convenience. The 
tilde (~) quantities are not directly observable. There is a large arbi- 
trariness in the choice of their scaling, although some justifications are 
given a posteriori, at least for electromagnetic field sources. The tilde 
quantities play the role of parameters to construct initial data solutions 
of the constraints that can, hopefully, be used for evolution and lead to 
results that can be confronted with observations. 


VIIIL.9 Motion of a system of compact 
bodies 


There is now a wealth of new results from observations of motions of 
stars and galaxies obtained by powerful Earth-based and satellite tele- 
scopes. Also, a new generation of gravitational wave detectors (LIGO, 
GEO, TAMA, and VIRGO) are now operational, and it is hoped that 
within the next few years they will reach sensitivities that will allow for 
the first time observations of gravitational radiation. The prime targets 
of these observations are the motions of compact binaries: black holes 
and neutron stars. A fundamental problem for Einsteinian gravitation 
theory is the modelling of the motion and the gravitational wave emis- 
sion of systems of compact bodies. For many years, considerable effort 
has been spent towards this goal, with little success until fairly recently, 
because of the nonlinearity of the equations, their hyperbolic-elliptic 


46More refined splittings have been 
given by York under the name of con- 
formal thin sandwich. 
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47The problem becomes arduous when 
more than two bodies are involved. 


48For recent comparisons, see, for in- 
stance, Damour, Nagar, and Bernuzzi 
(2013) and Hinder et al. (2014). 


49See Damour and Nagar (2011). 


50See Pretorius (2007) and references 
therein. 


51 Baumgarte-Shapiro-Shibata— 
Nakamura. 


52See, for instance, Rezzolla and Zan- 
otti (2013). 


character, and the insufficient stability of the solution of the chosen evo- 
lution equations, in particular the so-called ADM ones for g and K. In 
Newtonian theory, two bodies represented by pointlike masses describe 
conics with focus situated at the centre of gravity.*” In Einsteinian gravi- 
tation, the dynamics of two-body systems is already too complex to be 
solved exactly, and prediction of the gravitational radiation they will 
emit in their inspiralling and merger can be obtained only by refined 
approximation methods (analytical or numerical) after a clever choice 
of unknowns and gauges, and splitting of equations between constraints 
and evolution. The mathematical well-posedness of the considered sys- 
tem is of course essential for the stability and reliability of the results. 
Refined analytical results have been obtained in the last decade both 
by the high-order post-Newtonian approach and by using the effective 
one-body (EOB) method. In parallel, thanks to the advent of extremely 
powerful computers, accurate numerical simulations of two-body systems 
have been obtained using various numerical codes. The results of the two 
types of methods (analytical and numerical) have been compared and 
found to be in remarkable agreement.*® 


VIII.9.1 Effective one-body (EOB) method 


The EOB formalism*® is an analytical approach that replaces the usual 
post-Newtonian expanded approximations to the motion and radiation 
of binary systems by resummed expressions. This formulation allows one 
to describe the dynamics of binary black hole systems up to the moment 
of the coalescence of the two black holes, and is further completed by 
the description of the final ringdown gravitational wave signal emitted 
by the distorted black hole formed during the coalescence process. This 
formalism thereby provides a quasi-analytical description of the entire 
waveform emitted by a binary black hole system from early inspiralling 
up to the final ringdown. It has also been extended to the description of 
the coalescence of binary neutron star systems. 


VIII.9.2 Numerical Relativity 


Although its origins lie in the 1960s, it is in the years since 2005 that 
Numerical Relativity has seen considerable expansion, owing to the 
enormous growth in the power of computers and to the development 
of stable codes describing the coalescence of compact binary systems. 
This has permitted the modelling of more realistic situations than be- 
fore, although difficulties remain. Two main codes are now in use: one 
is based on damped generalized wave coordinates,°? while the other, 
called BSSN,°! is a modification of the 3+1 decomposition using con- 
formal weights. The code BSSNz* improves the stability of results by 
introducing four more unknowns linked with the harmonicity functions. 

Numerical Relativity is a science in itself, outside of the scope of this 
book.°? 


VIII.10 Global properties 


VIII.10.1 Global hyperbolicity and global uniqueness 


A fundamental notion in the study of global properties of solutions of 
hyperbolic partial differential equations is that of global hyperbolic- 
ity, defined by Jean Leray”? for general hyperbolic differential equations 
as compactness™ (in a functional space) of paths joining two points 
whose tangent belong to the cone determining the domain of depend- 
ence of solutions of the Cauchy problem; that is, causal paths in the case 
of relativistic causal systems. Global hyperbolicity forbids, in particu- 
lar, the existence of closed causal curves.” Later, Penrose®® introduced 
what he called strong causality: a Lorentzian manifold (V,g) is said 
to be strongly causal if any neighbourhood of any point x contains a 
smaller neighbourhood such that no causal curve penetrates in it more 
than once. It can be proved” that global hyperbolicity is equivalent to 
strong causality together with compactness of the subsets of V that are 
intersections of the past of any point x with the future of any other point 
y, traditionally denoted by I- (æ)  I(y). 

A very useful definition due to R. Geroch is that of a Cauchy hy- 
persurface in a Lorentzian manifold (V,g), that is, a submanifold S$ 
of codimension 1 intersected once and only once by each inextendible 
timelike curve. Geroch®® proved that the existence of a Cauchy sur- 
face is equivalent to global hyperbolicity. He also proved the important 
property that the support of the spacetime is then a product S x R. 

With these definitions, one can complete the existence theorem by a 
global uniqueness theorem proved by Choquet-Bruhat and Geroch;°? it 
uses in particular Zorn’s lemma. Proofs without the use of this lemma 
have been published recently. 


Theorem VIII.10.1 (existence and geometric global uniqueness) A 
vacuum Linsteinian development (V,g) of an initial data set (M,g, K), 
satisfying the vacuum Einstein constraints, exists and is unique (up to 
an isometry) in the class of maximal® globally hyperbolic spacetimes. 
The manifold M embedded in (V,g) is a Cauchy hypersurface. 


VIII.10.2 Global existence 


A solution of the vacuum Einstein equations is called global if it is a 
complete Lorentzian manifold. It is generally called singular if it is in- 
complete. Incompleteness can result from the appearance of a curvature 
singularity (with the Kretschmann scalar® tending to infinity on a 
causal curve for a finite value of the canonical parameter) or another 
phenomenon.®* Proving either global existence of solutions or, on the 
contrary, their incompleteness and the formation of singularities, are 
difficult problems and the subject of active research. Some remarkable 
achievements linked with the special properties of the Einstein equa- 
tions, fundamental in physics but quite particular nonlinear geometric 
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53Leray (1953). The notion was applied 
and described in detail for Lorentzian 
manifolds by Choquet-Bruhat (1968). 


54The empty set is considered as com- 
pact. 


55A path is a mapping from an inter- 
val of R into the manifold; a curve is 
the image of a path in the manifold. 
A closed causal curve can be covered 
by an infinite sequence of causal paths 
joining two points, a non-compact set. 


56See, for instance, Penrose (1968). 
57See Choquet-Bruhat (1968). 


58Geroch (1970). For a detailed proof, 
see, for instance, Ringström (2009). 


59Choquet-Bruhat and Geroch (1969). 


60See Ringström (2009), corrected by 
Ringström in <http://www.math.kth. 
se/~hansr/mghd.pdf> and in Sbierski 
(2013). 


61 Which cannot be embedded in a big- 
ger one. 


62The square of the Riemann tensor. 


63See Problem IV.11.4 on Taub space- 
time. 
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64 Friedrich (1986). 


65This formalism uses on a four- 
dimensional manifolds Weyl spinors 
represented by objects with 2-index 
components. 

For a tensorial formulation of 
the Friedrich conformal system, see 
Choquet-Bruhat and Novello (1987), 
CB-DM II, Chapter V, Section 7. 


66See Christodoulou and Klainerman 
(1993). See also Klainerman and Nicolo 
(2003) and Bieri (2007). 


67 This theorem showed that Einstein’s 
intuition that the only global asymp- 
totically Euclidean vacuum solution of 
his equations is Minkowski spacetime 
required one more hypothesis to be 
true, namely a faster fall-off of the met- 
ric at spatial infinity, which implies the 
vanishing of the ADM mass. 


68 Their proof has been notably simpli- 
fied by Lydia Bieri. For the use of equa- 
tions satisfied by the Riemann tensor, 
see Problem [.14.5 in Chapter I. 


69%Lindblad and Rodnianski (2005). 
70Rodnianski and Speck (2009). 


See articles and references in 
Chruściel and Friedrich (2004). 


T2See Hawking and Ellis (1973). 


“3Sugeested in private discussions by 
Geroch and Penrose in 1969, and 
formalized by Eardley and Mon- 
crief (1981). 
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differential equations, have been obtained in the last twenty years with 
the use of advanced and intricate mathematics. The details of these 
achievements are outside of the scope of this book (in fact of any book 
of reasonable size!). We will only give some definitions and quote some 
results with references to original papers. 

A future global existence theorem for data near Minkowskian data 
on a manifold tending to a null manifold at infinity was given by H. 
Friedrich® using an original conformal construction and the Newman- 
Penrose formalism.®° 

A breakthrough in the problem of global existence for initial data 
on a spacelike manifold was the proof in 1989 by Christodoulou and 
Klainerman®® of the global nonlinear stability of Minkowski spacetime— 
that is, the construction of complete globally hyperbolic solutions of the 
vacuum Einstein equations with asymptotically Euclidean initial data 
near data for the Minkowski spacetime.” They used in particular a 
double null foliation and delicate estimates of the Riemann tensor.°® 
The proof of the global nonlinear stability of Minkowski spacetime was 
obtained later by Lindblad and Rodnianski®? through wave coordinates 
and the use of special properties (a kind of generalized null condition) 
of the Einstein equations. The global existence result has been extended 
to cases of Einstein equations with field sources, and even to the case or 
irrotational fluid sources.” 

Proofs of global existence have been obtained for small initial data on 
several categories of compact manifolds with symmetries by: Moncrief, 
by Choquet-Bruhat and Moncrief, and by Choquet-Bruhat, and, in a 
case without symmetries, by Andersson and Moncrief.”1 


VIII.11 Singularities and cosmic 


censorship conjectures 


The problem of the formation of singularities from generic initial data 
was attacked by Penrose and Hawking in the 1970s.’? Inspired by the 
Schwarzschild solution, they discussed what they called the strong and 
the weak cosmic censorship conjectures. These conjectures con- 
cern generic Einsteinian spacetimes with physically reasonable sources. 
A generic spacetime can be understood as a spacetime with no iso- 
metry group, or as a spacetime that is stable (in some sense to be 
defined), under small perturbations. Reasonable sources are physical 
sources that have a hyperbolic, causal evolution and do not have their 
own singularities (shocks, shell crossings, etc.). 


VIII.11.1 Strong cosmic censorship conjecture 


The strong cosmic censorship conjecture’? aims at proving the deter- 
ministic character of General Relativity at the classical (non-quantum) 
level. It can be formulated as follows: 


VHT. 11 


Strong cosmic censorship conjecture The maximal globally hyper- 
bolic vacuum Einsteinian development of generic initial data is 
inextendible as a vacuum, even non-globally hyperbolic, Einsteinian 
spacetime.” 


The inextendibility is sometimes conjectured to hold for all devel- 
opments, not necessarily solutions of the Einstein equations—this is 
certainly the case when incompleteness is due to the appearance of 
infinite curvature. 

Examples of non-globally hyperbolic extensions of a globally hyper- 
bolic spacetime with initial data on S&S? are provided by the 
Taub-NUT spacetimes.” However, these extensions do not provide a 
counterexample to the strong cosmic censorship conjecture, because the 
Taub spacetime, because of its symmetries, is not generic. V. Moncrief 
and J. Isenberg have shown that some qualitative features of Taub-NUT 
spacetime imply in fact the existence of an isometry group. 


VITI.11.2 Weak cosmic censorship conjecture 


The original idea of Penrose came from the study of spherical gravi- 
tational collapse where a black hole forms, hiding the singularity to 
timelike observers. The weak cosmic censorship conjecture” is that 
in generic Einsteinian spacetimes with physically reasonable sources, it 
is not possible for there to form any naked singularity, that is, a sin- 
gularity visible by an observer; in other words, the past of no point x 
contains a future causal curve that is inextendible. 

Note that the big bang is not a counterexample to this conjecture—it 
has no past, and hence does not correspond to any future inextendible 
causal curve. Nor is the Schwarzschild metric with m < 0 on the mani- 
fold (R? — {0}) x R a counterexample to the ‘non-naked singularity’ 
conjecture, because it is not considered to be a physically meaningful 
metric. 

A more elaborate version of the weak cosmic censorship conjecture 
uses the Penrose definition of conformal null infinity—it raises technical 
difficulties regarding the existence and smoothness of the boundary of a 
Penrose diagram for general spacetimes. 

Christodoulou has obtained in a series of papers complete results sup- 
porting the weak cosmic censorship conjecture for the Einstein-scalar 
equations with spherical symmetry. 

An important theorem due to Penrose links the existence of sin- 
gularities with trapped surfaces.” In a four-dimensional spacetime, a 
trapped surface S is a two-dimensional compact spacelike surface 
without boundary (typically a sphere) such that the two families of 
future-directed null geodesics orthogonal to S have a positive conver- 
gence whether they are directed inwards or outwards. Penrose’s theorem 
says that in a non-compact spacetime, a trapped surface always hides 
an incompleteness of the spacetime. 


Singularities and cosmic censorship conjectures 
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T4Loosely speaking ‘generic’ means 
‘without special properties’. In a more 
precise mathematical sense, it can be 
interpreted that ‘generic initial data’ 
form a dense subset of the set of pos- 
sible initial data in some relevant top- 
ology, see for instance discussion in 
Isenberg and Moncrief (2002). 


The formulation leaves open the 
smoothness required of extendibility 
and also the hypotheses on initial data. 
It was pointed out by P. Chrusciel that 
the requirement that the initial mani- 
fold (M, g) be complete is not sufficient 
to make the conjecture plausible— 
initial data for Minkowski spacetime 
on a hyperboloid are then a trivial 
counterexample. 


76See Problem IV.11.4 in Chapter IV. 


77The weak and strong cosmic conjec- 
tures are independent. 


T8 Penrose (1965). 
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“9Christodoulou (2009). 


80Klainerman, Luk, and Rodnian- 
ski (2013). 


In a 500-page book”? Christodoulou proves the formation of 
trapped surfaces due to ‘short-pulse’ data on a characteristic cone. 
Christodoulou’s®? construction has recently been extended by Klain- 
erman, Luk, and Rodnianski to data on a null geodesic segment and his 
proof has been simplified. 


VIII.12 Problems 


VIII.12.1 Symmetric hyperbolic systems 


The system of N first-order linear partial differential equations on 
R” x R, 


Ur at i = 0, 

(VIII.12.1) 
with u = (uy, I = 1,..., N) a set of N unknown functions, f = (ft, I = 
1,..., N) a set of given functions, and M® and A given N x N matrices, 
is called symmetric hyperbolic (SH) if the matrices M® are sym- 
metric (M®17 = M®71) and the quadratic form defined by the matrix 
M?, the coefficient of 0/0x°, is positive-definite; that is, 


wel —ut+Au+f=0, ie. MeV 
Ox® Ox” 


M? (u,u) = M uruz>0 for all u #0. 
1. Define a function called the energy of u at time z? = t by the 
integral 


-;/ M (u,u)d’z, desde ..,.d2", (VIII.12.2) 
2°=t 


Show that under appropriate smoothness and boundary conditions, 
E(u) satisfies the energy equality 


PET E +f A ee mor) Ki 


+A ujur + Puy dx dt. 
(VIII.12.3) 


2. Show that the energy equality implies an integral energy inequal- 
ity of the form 


T 2 
Caos / [Ci(t) Be(u) + C2(t) Be (w)] dt. (VIIL12.4) 


to 


Deduce from it a bound of the energy. 


Solution 


1. It holds that 


and hence, if the matrices M® are symmetric (ie. M077 = M%/1), 
then 


o 1 o 
uzM®17 T ur = gM T (urus) 
Le a lJ o a IJ 
= 5r (uruy M=”) — pull 5 aM 


Under appropriate smoothness and fall-off conditions at infinity, 
the integral on R” of (0/0z")(uzuyM*7) vanishes owing to Stokes’s 
formula. There remains the equality 


i 1 ð alt ð a, IJ 
k Jaata [oat a] 


+ Al urug + Pu) dz” dz? = 0, 


which gives the energy equality after performing in the first term the 


integrations first with respect to x” and then over 2°. 


2. Assume that M° is uniformly positive-definite, i.e. 
MoM urus > Colul?, 


while (0/0x%)M°1/ and A!” are uniformly bounded and f” is 
square-integrable for each t. It is straightforward to deduce an inte- 
gral energy inequality of the given form from the energy equality using 
elementary properties of integrals and the Cauchy—Schwarz inequality 


[wae < (S. Para) “3 TOR 


To prove the energy inequality, one uses a general theorem! for dif- 
ferential equations that says that if f(t, y) is a function continuous in t 
and Lipschitzian in y, then the differential equation 


z= f(t, z) 


has one and only one maximal solution taking a given initial value. Any 
C1 function y satisfying the inequalities 


y’ < f(t, y), y(to) < 2(to) 


satisfies y(to) < z(to). 
This theorem implies that if y(t) satisfies the integral inequality 


Co > 0, a constant, 


E È 
WT) < olto) + f [Crylt) + Cauley?) at, 


to 


(VIII.12.5) 
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81See the proof for instance in Cho- 
quet (2006). 
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82A result sometimes called Grénwall’s 
lemma. 


with Cı and Co non-negative, then y(t) is less than or equal to the 
function z satisfying the corresponding equality®? and such that z(t) > 


y(to). 
In the considered case, y(t) := E(u), one finds that z is the solution 
of an equation of the form 


gZ =C + C227, 
i.e. setting z = Ç?, the linear equation 


2¢' = Cı + Co, 


1 
which has a solution with initial data Ço := yg continuous and bounded 
for all finite t. Writing down this solution is left as an exercise for the 
reader. 


VIII.12.2 The wave equation as a symmetric 
hyperbolic system 


1. The wave equation with mass a constant m for a scalar function u on 
a Lorentzian manifold (V, g) is 


g° PV gdgu —mu+f=0. 


For g = 7, the Minkowski metric, write this as a symmetric hyperbolic 
system. 

For m a constant, compare the ‘mathematical energy’ of u defined 
by (12.2) with the physical energy as it would be defined from the 
wave equation by multiplying it with Ou/dt. 


2. Extend the study to the case when (V,g) is a sliced Lorentzian 
manifold. 


Solution 


1. Recall our signature (—,++...+). The equation reads 


—n?V Ogu +mu= dou — 2 3u +mu=f, Og:= Ja 


Set ua = O,u and consider the system M°0,U with unknowns U; = 
(u; Ua): 


ou = uo, oui — juo = 0, ouo — X dius +mu= f 


VIII. 12 


The principal operator M°0, is the matrix 


do 
Oo —O1 


—-O; -0, —-03 o 
The matrices M® are obviously symmetric and M°, the unit matrix, 
is positive-definite. 


Multiplication of the wave equation by Ou/Ot gives by simple 
computation the identity, if m is a constant: 


Ou | 0?u 3u 
at E > Gap Oe sl 


18 |/3u\’? ðu \? 3 ð (du du du 
“35 (3) (Gs) -m| DE) 


a i 


Hence, by integration when f = 0, using Stokes’s formula, 


1 du\? du \? 2 i 7 ; 
J OE 5 Jri H mu’ | dx” ...dx” = constant if f = 0. 
M; 


a 


The integrand is interpreted as the physical energy density at time t, 
positive if m > 0 and zero for m = 0 only if u = constant. 


. For a non-flat Lorentzian metric on a sliced manifold, we take an 
orthonormal frame adapted to the slicing, 

9 = Nap OO", 
with 


0° = Ndt, Ø =a) dx!+p' dt, hence 0 = oe 


ð ; 
Ori’ ð; ‘ 


O 
Əxi” 
The wave operator reads 

9° PV aðu = NP (3aðpu + Yag0nu). 
Introducing the new unknowns ua = O,u and the identities 


Q 
oui = O; Uo = Ciota 


gives for the unknowns U = (u, ua) a linear system with operator in 
matrix form 


M°O,U + AU. 


The matrices M* are the same as in the Minkowski case, and hence 
symmetric, and the matrix coefficient of 0/0t in M? is the Minkowski 
M? metric. The same reasoning applies as previously. 
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83See Choquet-Bruhat and Moncrief 
(2001) and Choquet-Bruhat (2004). 


84Moncrief (1986). 


VIII.12.3 The evolution set of Maxwell equations 
as a first-order symmetric hyperbolic system 


The evolution set of Maxwell equations on Minkowski spacetime M4, 
with unknowns (E, H), comprises (see Chapter IT) 


ðE! ƏH? ƏH? | ƏH! dE? OB? 
= i a at | (VE) 


together with the equations obtained by circular permutation of the 
indices 1, 2, 3. The characteristic matrix M°0, is 


Oo a O2 03 
Oo —ð Og 0 
Oo 03 -—O; 
—02  —03 Op 
—02 oi Oo 
03 0 —-d, Oo 


It is symmetric, and M°, the unit matrix, is positive-definite. Note 
that in this case the energy densities corresponding to symmetric 
hyperbolicity and to physics coincide, both being equal to $(|E|?+|H]|?). 


VIII.12.4 Conformal transformation of the CF 
constraints 


Show that two different choices of the given metrics y and 7 lead 
to equivalent conformally formulated constraints if the sources are 
appropriately chosen. 


VIII.12.5 Einstein equations in dimension 2 + 1 


The vacuum Einstein equations are trivial in the case n = 2 in the 
sense that Ricc(g) = 0 implies that the spacetime metric g is locally flat 
when n + 1 = 3. However, the (2 + 1)-dimensional Einstein theory has 
a topological content: (V, g) is not necessarily the Minkowski spacetime 
M3. In particular, V = S x R with S a two-dimensional compact surface 
can be a Lorentzian flat manifold with S the torus T? or a surface 
of genus greater than 1. On the other hand, (2 + 1)-dimensional, non- 
flat, Einstein equations with sources appear for spacetimes that admit 
a one-parameter spacelike isometry group.*? 

Write a conformal formulation of the constraints in the case n = 2 by 
setting®* 


: _ 1. 
g =exp(2A)y and K” = exp(4A) KY + ZIT 


Solution 


Elementary computation gives 
are . za T a 
Vi KY — g” air = exp(4A)DiK” — a exp(—2A)0;T. 


The constraints split again into a linear system for K and a semilin- 
ear equation for A when K is known, if 7 is a given constant and the 
momentum of the sources is zero, or properly weighted. 


VIII.12.6 Electrovac Einsteinian spacetimes, 
constraints 


The Einstein equations with electromagnetic source are (see Chapter IV) 
Sop = Tap = Fo Fay — T gag F™ Fy, (VIII.12.7) 
where the electromagnetic 2-form satisfies the Maxwell equations 
dF=0, ie VoaFg,+ VyFag + Vey = 9; (VIII.12.8) 
and, in vacuo, 
6F =0, ie. Vor? =0. (VIII.12.9) 


The electromagnetic initial data on a hypersurface M are a 2-form F 
and a vector field E. The 2-form F is the form induced on M by the 
electromagnetic field F, while E is the electric vector field on M relative 
to the unit normal n to M in the spacetime metric. 


1. Show that in a Cauchy adapted frame, the components of E are 
E' := NF", with N the lapse of the metric g.  (VIII.12.10) 


2. Show that a solution of the Maxwell equations must satisfy on M the 
constraints 


dF =0, ie Vati + ViFhi + Viljn = 0, (VIII.12.11) 
and 


ViE' =0. (VIII.12.12) 


Solution 
1. EÊ Sng? na = NS i.e E? =0, EŻ = NF“. 
2. dF = 0 and 6F = 0 on M imply 
OnF i; + Oj Fi + OF jn = Va Fij + VjFhi + ViFin =0 


and 


— 1 
V, fe = €7fe. ps Fò = 
ai ar VDet g 


yg; EF + F°0;./9 = 0, 
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85It seems that this gauge, although 
used by Lorentz, was first introduced 
by another mathematician named Lor- 
enz (without a ‘t’). 


that is, using Det g = N? det g, 
0;(N\/Det gF”) = VE’ =0. (VIII.12.13) 


VIII.12.7 Electrovac Einsteinian spacetimes, 
Lorenz gauge 


The vector potential A is a locally defined 1-form such that 
F=dA, ie. Fag = 0,Ag — OgAa- (VIII.12.14) 


The first Maxwell equation dF = 0 is automatically satisfied. The 
potential A is said to be in Lorenz" gauge if it has zero divergence: 


VA? =0, (VIII.12.15) 


Show that, in vacuo, the second Maxwell equation reduces in wave 
coordinates and Lorentz gauge to a quasidiagonal semilinear system of 
wave equations for the components of A of the form 


9° 0,0aAg = folg, 8g, OA). (VIII.12.16) 


State a local-in-time existence and uniqueness theorem for a solution of 
the Cauchy problem for the Einstein—Maxwell system 


Solution 


V° Fag = 9° (Fag = Ty pho — Maus) E 0. 


In wave coordinates, it holds that TS = 0, and the above system 
reads, in terms of A, 


9° [an3a Ap — On9pAa —Thg(IaAu — Op Aa)| = 0. 
By elementary manipulations, these equations become 


g°^3 ða AB — 3a (g^ Aa) + (3gg ^) ða Aa — T$glÂaAp — 0,Aq) = 0. 

(VIII.12.17) 
If A satisfies the gauge condition g°*0,Aq = 0, they reduce to a qua- 
sidiagonal semilinear system of wave equations for the components of A, 
of the form 


9° 0)0aAg = folg, 8g, OA). (VIII.12.18) 


The equation g°*0,Aq = 0 is equivalent in wave coordinates, where 
pore , = 0, to the Lorenz gauge condition, because 


VAA = 9V y Aa = 9 (Oy Aa — TË Ay). 


Equations (12.18) together with the Einstein equations in wave gauge, 
with source the Maxwell tensor of F, constitute a quasidiagonal quasi- 
linear system of wave equations for the pair g, A. Local existence and 


uniqueness for a solution of the Cauchy problem for the pair (g, A) in 
wave and Lorenz gauge results from the general theorem quoted before. 
The proof that the constructed (g, A) satisfies the gauge conditions if 
the initial data satisfy the constraints follows lines analogous to those 
given in Section VIII.4, using, in addition, the identity VaV gF? = 0. 

Various other gauge conditions can be used on A to solve the Maxwell 
equations: for example the temporal gauge Ap = 0 or the Coulomb gauge 
V;,A' = 0. They have proved useful in different domains. 

All these gauges, like the Lorenz gauge, generalize to Yang-Mills fields. 
They lead to equations with the same principal parts as in the case of 
electromagnetism, but in this case only semilinear.*® 


VIII.12.8 Wave equation for F 


1. Show that the Maxwell equations satisfied by the electromagnetic 2- 
form F on a Lorentzian manifold (V, g) imply that F satisfies on V a 
quasidiagonal, quasilinear system of wave equations with coefficients 
depending on the curvature of g. 


2. Extend the result to the Yang-Mills case. 


Solution 


1. The Maxwell equations in vacuuo, dF = 0, ôF = 0, imply 
(ôd + dd) F = 0; (VIII.12.19) 
that is,” by straightforward computation using the Ricci identity, 
9° VaV Fay + ROFox — R$ Fap +2R°,? „Fap =0. (VII-12.20) 


2. The Yang-Mills gauge- and metric-covariant derivative of F is defined 
by 


VF := VF + [A, F], (VIII.12.21) 


with V the Riemannian covariant derivative and [.,.] the bracket in 
the Lie algebra corresponding to the considered Yang-Mills model. 
The Yang-Mills equations in vacuo are 


Va Foy + Vy Fap + VeF ya =0 (VIIT.12.22) 
and 
VaF™ = 0. (VIII.12.23) 


These equations imply the following second-order semilinear equation 
for F, depending on A: 


V°V) Fup — 2[F}, Fy] = 0, (VIII.12.24) 
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86 That is to say, only linear in the prin- 
cipal terms—in the case of the Yang- 
Mills system, the first-order derivatives. 


87See, for instance, the expression for 
this operator for an arbitrary p-form in 
CB-DMI, V B 4. 
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88Eardley and Moncrief (1982). See 
also the survey article by Choquet- 
Bruhat (1983). 


89Bel (1958). 


where 
WV a Fag = VV) Fop + 2V5[A7, Fap] — [VyA7, Fag] 
+[A,,[A7, Fap]. (VIII.12.25) 


This equation for F plays an essential role in the proof of the global 
existence of a solution of the Cauchy problem for the Yang-Mills 
equations by Eardley and Moncrief.88 


VIII.12.9 Wave equation for the Riemann tensor 


Show that the Riemann tensor of an Einsteinian spacetime of arbi- 
trary dimension satisfies a quasidiagonal semilinear system of wave 
equations®? in the spacetime metric. 


Solution 
The Riemann tensor satisfies the Bianchi identities 
VaRgy, rp + VyRag, An + VeRyo,rn = 0. (VIII.12.26) 


One deduces from this identity and the Ricci identity an identity of the 
form 


VV aRpy, au t VyV" Rag, rApwtvV eV" Rya apt Sby, Au = 0, (VIII.12.27) 
where Sig,,y,, is a homogeneous quadratic form in the Riemann tensor: 


Sby, Ap = {Ry Rpb,àn + Rey a? Rap,Au + [(R +, Rap, pu) = (A = a)l} 


et aaa 
(VIII.12.28) 
Returning to the Bianchi identities gives, by contraction, 


VaRpy:” u + VyRag n + Ve Ryo, = 0. (VIII.12.29) 
Therefore, using the symmetry Rag yp = Ryp,ap; 
VaR“ Bau + Vultag — Vi Rug = 0. (VIII.12.30) 


If the Ricci tensor Rag satisfies the Einstein equations Rag = pag, then 
the previous identities imply equations of the form 


V°VaRey vu + Say:du = day Aus (VIII.12.31) 
where Jgy,,,, depends on the sources pag and is zero in vacuum: 
Joy, Au = Va(V nla — V apup) — (8 =): (VIII.12.32) 


Note that (12.26) and (12.29) modulo the Einstein equations are 
analogous to the Maxwell equations for the electromagnetic 2-form F: 


dF=0, 6F=J, (VIII.12.33) 


where J is the electric current. 


VIII.12.10 First-order symmetric hyperbolic system 
for the Riemann tensor, Bel—Robinson 
energy 


Write in a Cauchy adapted frame a first-order symmetric differential 
system with constraints satisfied by the Riemann tensor of a (3 + 1)- 
dimensional Einsteinian spacetime. 


Solution 


In a coframe 6°,’ where go; = 0 the Bianchi identities (12.26) with 
{aby} = {ijk} and the equations (12.30) with 8 = 0 do not contain 
derivatives o of the Riemann tensor. We call them ‘Bianchi constraints’. 
The remaining equations, called from here on ‘Bianchi equations’, read 
as follows: 


VoRnkAp + VERon Au + Va Rko, Ap = 0, (VIII.12.34) 


VoR inn + Varian = VaPui — VuPri = Janis (VIIL12.35) 


where the pair (Ay) is either (07) or (jl), with j < l. There are three of 
one or the other of these pairs if the space dimension n = 3. 

Equations (12.34) and (12.35) are, for each given pair (Au, À < p), a 
first-order system for the components Rpx,,, and Rony. If we choose 
at a point of the spacetime an orthonormal frame, then the principal 
operator is diagonal by blocks; each block corresponding to a choice of 
a pair (Au, \ < u) is a symmetric 6 x 6 matrix that reads 


Oo 0 0 è & - 0 
0 æ 0 0 s -o2 
0 0 & -8 0 GH 

Op 0 -ð OA 0 0 

-01 03 0 0 æ 0 
0 -0 Oo 0 0 æ 


The numerically valued matrix M+ of coefficients of the operator 0/0t 
corresponding to the Bianchi equations relative to the Cauchy adapted 
frame is proportional to the unit matrix, with coefficient N~?, and hence 
is positive-definite; thus the proof is complete. The energy corresponding 
to the first-order symmetric hyperbolic system (12.34), (12.35) is the 
Bel—Robinson energy. 


VIII.12.11 Schwarzschild trapped surface 


Show that a 2-surface t =constant, r = ro of the Schwarzschild spacetime 
is not trapped if ro > 2m. Show that it is trapped if ro < m. 
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TAs said before (Chapter IV), this 
property motivated Einstein in the 
choice of his equations. 


Relativistic fluids 


IX.1 Introduction 


A fluid matter source in a domain of a spacetime (V, g) is such that 
there exists in this domain a timelike vector field u, called the unit 
velocity, satisfying g(u, u) = gagu°u? = —1, whose trajectories are the 
flow lines of matter. A moving Lorentzian orthonormal frame is called 
a proper frame if its timelike vector is u. In a proper frame, the unit 
velocity has components u? = 1, u’ = 0. 

One may also consider null fluids, with flow lines trajectories of a null 
vector field u, i.e. such that u%uUg = 0. 

The Einstein equations with fluid source are 


1 
Sag = Rap — 59aR = Tap, (IX.1.1) 


where the stress-energy (momentum) tensor T is deduced from the 
equivalence principle and its expression in Special Relativity (see Chap- 
ters II and II) for the considered type of fluid. The conservation 
equations 


Vat? =0, (IX.1.2) 


which in Special Relativity resulted from the physical laws of conserva- 
tion of energy and momentum, are in General Relativity! a consequence 
of the Bianchi identities. The conservation equations (1.2) must some- 
times be completed by equations satisfied by other physical quantities 
appearing in T (e.g. the electromagnetic field). 

In this chapter, we will describe general properties of perfect fluids. 
When appropriate definitions are given, a number of these properties 
generalize well-known properties of non-relativistic perfect fluids. How- 
ever, the equivalence of mass and energy in Relativity introduces a 
number of fundamental differences. The relativistic causality principle, 
limitation by the speed of light of the speed of any macroscopically 
transmitted signal, leads also to new considerations. 

We will only briefly touch on the case of dissipative fluids, which are 
still subject to controversies. 


IX.2 Case of dust 


A fluid source is called pure matter or dust if in a proper frame, it 
has neither momentum nor stresses; hence, in a proper frame, the only 


non-vanishing component of the stress—energy tensor is To) = r, the 
proper mass density. In an arbitrary frame, 


Tog =TUatg, U-Ua = —l, (IX.2.1) 


where u is the unit flow velocity. 

At usual laboratory, and often at astronomical, scales, this dust stress— 
energy tensor is a good approximation of a general fluid stress—energy 
tensor, since the mass energy is of order c? (c being the velocity of light), 
with respect to the other forms of energy. 

The conservation laws for the dust stress-energy tensor read 


Val = wVo(rut) + ru’V ou =0. (IX.2.2) 


They give, using the property uug = —1, and hence ugVqu® = 0, the 
continuity equation (conservation of matter) 


Val(ru®) =0 (IX.2.3) 
and the geodesic motion of the particles 
u*Vau? = 0. (IX.2.4) 


Similar equations are obtained for a null-dust model where u°ug = 0. 

The geometric initial data for the spacetime metric g on an initial 
manifold M are a Riemannian metric g and a symmetric 2-tensor K. 
The initial data for a dust source are a scalar function 7 on M and 
a tangent vector field ọ to M. A solution (V,g,r,u) of the coupled 
Einstein—dust equations is an Einsteinian development of the initial data 
set (M, 9, K,7,0) if g and K are respectively the induced metric and the 
second fundamental form of M as an embedded submanifold in (V, g), 
while 7 is the function induced by r on M and v is the value on M of 
the dust velocity with respect to the proper frame of an observer with 
timelike vector orthogonal to M in (V, g). In local coordinates such that 
the values on M of the shift and the lapse of the development are re- 
spectively 3 = 0 and N = 1, it holds that 0’ = (u°)~!u', where u% are 
the components of u in the considered coordinate system at points of M. 

It can be proved? that an initial data set (M, g, K,7,0) satisfying the 
constraints for the Einstein equations with dust source admits a globally 
hyperbolic and maximal Einsteinian development? (V, g), unique up to 
isometries, with dust source (r, u). 

The problem of how to use the general equations of fluid evolution to 
determine the motion of isolated bodies is a long standing one, which 
has received only very partial answers. In the pure matter case, it can 
be proved* that a solution can be found for isolated bodies, i.e. when 
the support w of 7 is the disconnected union of compact sets, by giving 
to v arbitrary values outside w (see Problem IX.20.2). 
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?Fourés (Choquet)-Bruhat (1958). For 
more refined statements on the re- 
quired regularity, see YCB-OUP2009, 
Chapter IX. 


3Non-complete in general. 


4Choquet-Bruhat and Friedrich (2006). 
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5This model is seldom valid in the real 
world, where opposite charges interact. 


IX.3 Charged dust 


The stress-energy tensor of charged pure matter (dust) is the sum of 
the stress-energy tensor of the matter and the Maxwell tensor of the 
electromagnetic field F: 


Tag = TUcts + Tap, (IX.3.1) 
with 
Tap = Fa Foa — T gagF™ Fay. (IX.3.2) 
The Maxwell equations are, 
dF=0 and V-F=J, ie. VaF°®%® = J? := quf,  (IX.3.3) 


where J is the convection electric current of the charge density q. They 
imply the conservation of electric charge: 


Valgu“) = 0. (IX.3.4) 


We introduce the electromagnetic potential A, a 1-form such that (for 
simplicity, we suppose that A exists globally on the considered domain) 


F = dA. (IX.3.5) 
We take A in Lorenz gauge, i.e. such that 
dA =0. (IX.3.6) 
The Maxwell equations then read as a wave equation for A, namely 
Vad" AP — RP A* = JP = qu’. (IX.3.7) 
Modulo the Maxwell equations (and u“%ua = —1), the stress-energy 
conservation equations are equivalent to 
Valru”):=0 (IX.3.8) 
and 
ru°V que + quF® =0. (IX.3.9) 


Equations (3.4) and (3.8) imply 


wra,(2) <0 


that is, the specific charge q/r is constant along the flow lines; it is 
constant throughout the spacetime that we construct if it is constant ini- 
tially. We will make this simplifying (though not necessary) hypothesis, 
and set 


q = kr, 
with k some given constant. Equation (3.9) can then be replaced by 


uV ou? + ku, FP = 0. (IX.3.10) 
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The proof of the existence of a solution (local in time) of the Cauchy 
problem for the full Einstein—Maxwell—dust system when the initial data 
satisfy the Einstein and Maxwell constraints with dust source follows the 
same lines as in the vacuum Einstein—Maxwell case. Geometric global 
uniqueness can also be proved. 


IX.4 Perfect fluid 


IX.4.1 Stress—energy tensor 


We have written in Chapter IV the stress-energy tensor T of a perfect 
fluid in General Relativity: 


Tap = Ugg + p(gas + Uatg), (IX.4.1) 


where u is the unit flow vector, while u and p are respectively the energy 
and pressure densities. In a proper frame, it holds that Too = u, Toi = 0, 
and Tj; = pdi;, the isotropic-in-space pressure tensor. For classical fluids, 
H and p are non-negative.® 6Quantum or cosmological phenomena 


can lead to the appearance of negative 
Exercise IX.4.1 Show that in a spacetime of dimension n+ 1, the pressures. 


Einstein equations with a perfect fluid source can be written as 


: 1 
Rag = Pap, with Pap = (u + p)uaug T z gee = p). 


—1 
Hint: We have shown in Chapter IV (equation IV.2.5) that, with n the 
dimension of space (in the classical case, n = 3), 


1 
Pap = Tap — gas Ts 


We have here 
T = 9° Tyg = np — u. (IX.4.2) 
The energy—momentum vector relative to a timelike vector X is 
PE := TP Xo = (w+ p)utu? Xg + pX“. (IX.4.3) 


For X = —u, the components of P are P? = u, the energy density, and 
P* = 0, zero momentum density. 

We mention the following properties, which use a terminology fre- 
quently referred to, although misleading since weak and strong condi- 
tions are unrelated in the following propositions. 


Proposition IX.4.1 If u and p are non-negative, while X and Y are 
causal with the same time orientation, then 


1. The scalar TP XaYg is non-negative; one says that a perfect fluid 


satisfies the weak energy condition. TRecall that the Einstein equations can 


2. The scalar’ pagX®“X® is non-negative if X is causal; such a perfect be Suron as Rag = Paps Pap i= 
fluid satisfies the strong energy condition. Tab — gail Gap: 
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8See Taub (1959). 


3. If u >p and if X is past timelike, then the energy-momentum vec- 
tor Px is future timelike like u; such a perfect fluid satisfies the 
dominant energy condition. 


Exercise IX.4.2 Prove these properties. 


Hint: Compute these quantities in a proper frame and for Part 2 use the 
property (X°)* > S0(X")?. 


IX.4.2 Euler equations 
The conservation laws of a perfect fluid read 
VaT”? = (u + pju Vau? + 9g? dap + uP [Valu + p)u™] =0. (IX.4.4) 


By taking the contracted product with ug, one deduces from these 
equations the energy equation 


(u+ p)Vau® + u“ðau = 0, (IX.4.5) 
and, using this equation, one obtains the equations of motion 
(u+ p)utV aul + (9% + utu?)dop = 0. (IX.4.6) 
The set of equations (4.4), (4.5) is called the Euler equations. 


Exercise IX.4.3 Show that the condition u~u,. = —1 is conserved 
along the flow lines. 


Hint: The equations of motion imply uO, (u?ug) = 0. 


IX.5 Thermodynamics 


IX.5.1 Conservation of rest mass 


In relativity, mass and energy are the same entity. However, there exists® 
another scalar function of physical interest, namely the particle number 
density r (better called the rest mass density if there are different kinds of 
particles with non-zero rest mass). In the absence of chemical reactions 
or quantum phenomena, conservation of particle number implies the 
equation 


Val(ru’) =0. (IX.5.1) 


The quantity r~! plays the role of a specific volume. 


IX.5.2 Definitions. Conservation of entropy 


The difference between the total energy density u and the rest mass 
density r is called the internal energy density. One denotes by e the 
specific internal energy density; that is, one sets 


pw=r(lt+e). (IX.5.2) 


In the case of local thermodynamic equilibrium (reversible thermo- 
dynamics), one defines a specific entropy density S and an absolute 
temperature T by extending to relativistic perfect fluids the identity of 
the first law of thermodynamics, namely 


TdS := de + pd(r—*). (IX.5.3) 


The thermodynamic quantities p,p,S, and T are spacetime scalar 
functions. 


Theorem IX.5.1 In a perfect fluid, the thermodynamic identity (5.3) 
and the matter conservation equation (5.1) imply the conservation of the 
entropy density along the flow lines:® 


uas = 0. (IX.5.4) 
Proof. The identity (5.3) and the definition (5.2) of € give that 
Tu ða S = uae — r’ pu ðar = ur Oop — (ut pr ’u "ðar, 
and hence, using the energy equation (5.1), 


Tua S = -r?° (u + p)Valru®). a 


IX.5.3 Equations of state (n = 3) 


For a perfect fluid, only two thermodynamic scalars are independent; 
the others are linked to them by relations that are assumed to depend 
only on the nature of the given fluid. Usually, the general formula called 
the equation of state is the data of p as some function of u and S: 


p=p(S, u), 


invertible as a function u = u(p, S). 

Two circumstances are of particular physical interest for General 
Relativity, astrophysics, and cosmology. 

In astrophysics, one is inspired by what is known from classical flu- 
ids, with additional relativistic considerations. Particularly interesting 
cases are those of barotropic and polytropic fluids. 


Barotropic fluids 
A fluids is called barotropic when the equation of state reduces to 
p= p(y). 


The fluid dynamics is then governed by the energy and momentum equa- 
tions. The particle number conservation equation decouples from the 
others and can be solved after the fluid motion has been determined. 
Some physical situations that correspond to this model are the following: 


(1) Very cold matter, including models of nuclear matter, such as in 
neutron stars. 


IX.5 Thermodynamics 


See Pichon (1965). 
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10Tn particular by analysing data from 
the Planck satellite (see Chapter VII). 


(2) Ultrarelativistic fluids, i.e. fluids where the energy p is largely dom- 
inated by the radiation energy. Then, by the Stefan—Boltzmann law, 
it holds that u = KT* and p = sKT*, and hence 


1 
P= 5h. (IX.5.5) 


The same equations for p and p (with different constants A’) and the 
equation of state (5.5) are valid for a fluid of neutrinos or electron— 
positron pairs. 


Remark IX.5.1 The stress—energy tensor of an ultrarelativistic fluid 
is traceless. 


Polytropic fluids 
A fluids is called polytropic if it obeys an equation of state of the form 


p= f(S)r’. 


Several physical situations correspond to this case. 
Frequently in stellar situations, only the internal energy £ and pressure 
p are dominated by radiation; then £ = KrT* and 


1 1 
p= ght", hence p= 37E (IX.5.6) 


On the other hand, the thermodynamic identity together with the 
expressions £ and p imply 
4K AKT? 


dS = —-d(r'T"), hence S = F 


(IX.5.7) 


Eliminating T between (5.6) and (5.7) gives the polytropic equation of 

state of index y = $ i 
_K (38 

P= 3 (1K 


4 
F 
3 ) r3, with p=3p+r. (IX.5.8) 


More refined equations of state adapted to various physical situations 
have also been considered. 

In cosmology, new information obtained from modern ground-based 
and satellite telescopes has brought puzzling questions. It has been 
known for a long time that radiation energy is presently only a very 
small fraction of the energy content of the cosmos, but it has recently 
been found!” that baryonic matter itself represents at present only about 
4% of this energy content. Another 24% is constituted by dark matter, 
whose nature is conjectural, and the remaining 72% by what is called 
‘dark energy’, whose nature is a mystery. 

In the early universe of the big-bang models, at very high tempera- 
ture, the energy content of the sources can be roughly modelled as an 
ultrarelativistic fluid. For later times, after the formation of galaxies, 


IX.6 Wave fronts and propagation speeds 215 


cosmologists in general have assumed, for simplicity, a linear equation 
of state independent of entropy: 


p=(y- l)u. (IX.5.9) 


In order that the speed of sound waves not be greater than the speed of 
light (see Section IX.6), it has been supposed that 1 < y < 2. The case 
y = 1 corresponds to dust. In the case y = 2, the fluid is called stiff, or 
incompressible. In a stiff fluid, the speeds of sound and light are equal. 


IX.6 Wave fronts and propagation speeds 


In the spacetime of Galilean—Newtonian mechanics a wave front is a 2- 
surface in space that propagates with time. Its propagation speed at a 
point of space and an instant of time is the quotient by an infinitesimal 
absolute time interval dt of the infinitesimal distance between two wave 
fronts measured in the direction orthogonal to them, at times t and 
t + dt. In an Einsteinian spacetime, a wave front is a three-dimensional 
timelike or null submanifold. The definition of its propagation speed 
depends on the choice of an observer, and requires some thought. 
Let us give first some definitions. 


IX.6.1 Characteristic determinant 


The characteristic polynomial of a linear partial differential equation 
at a point x € R”*! is the polynomial obtained by replacing in its 
higher-order terms the partial derivatives 0/Ox% by the components of 
a covariant vector. 

In the case of a system of N first-order partial differential equations!! 
with N unknowns U! and principal part 


N 
aut 
Sc Apts, J=1,...,N, 
Ox 
I=1 
the characteristic polynomial ®(z, p) is the determinant of the N x N 
ý : Jia 
matrix with elements AZ" pg : 


(x, p) := Det [A7*(@)pa| : 


At a given point x, the equation ®(2,p) = 0 defines a cone in the 
cotangent space called the characteristic cone. 

The wave fronts associated to a system of first-order partial differ- 
ential equations on a spacetime are the submanifolds of this spacetime 
whose normals are roots of the characteristic determinant; that is, wave 
fronts satisfy in local coordinates an equation f = 0, where f is a solu- 
tion of the first-order nonlinear partial differential equation, called the 
eikonal equation, 


lFor the definition for more general 
systems, see, for instance, YCB- 
OUP2009, Appendix IV, Section 2. 
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12See for instance CB-DMI, IV Chap- 
ter 7. 


13Two 3-hyperplanes in four- 
dimensional spacetimes intersect 
generically along a 2-plane whose 
normal in a 3-plane is a vector. 


Ify 


Generically, discontinuities of the derivatives of the unknowns of the 
order appearing in these principal parts can occur only across such 
submanifolds. 

By general results on first-order partial differential equations, !? a wave 
front is generated by the bicharacteristics of the eikonal equation, also 
called rays, which are solutions of the ordinary differential system 

dx dpa 


= = IX.6.1 
o a a 


where À is a parameter, called the canonical parameter. The tangents 
to the rays issuing from a given point x generate a cone in the tangent 
space to the spacetime at x, called the wave cone, which is dual to the 
characteristic cone. The wave cone is the envelope of the hyperplanes 
whose normals (in the metric g) belong to the characteristic cone. A 
wave front at x is tangent to the wave cone at x along the direction of 
a ray. 


Exercise IX.6.1 Show that P(x, p) is constant along a ray. 


Exercise IX.6.2 The eikonal equation associated to the wave operator 
g on a spacetime (V, g) is 


Lino OF OF 
2 Ox Ox 


0. (IX.6.2) 


Show that the light cone and the characteristic cone of a spacetime (V, g) 
can be identified by the usual identification of the tangent and cotangent 
spaces to V through the metric g. 


IX.6.2 Wave front propagation speed 


The propagation speed of a smooth wave front with respect to an obser- 
ver is the propagation speed of the tangent plane to the wave front with 
respect to the proper Lorentz frame of this observer, with the following 
definition (one could give analogous definitions for higher dimensions, 
but to be clearer we prefer to stick to 3 + 1). 


Definition IX.6.1 The propagation speed at a point x of a three- 
dimensional hyperplane P with respect to an orthonormal Lorentzian 
frame at x is the velocity with respect to this frame of the vector of 
greatest slope in P, orthogonal to the 2-plane!® intersection of P with 
the space hyperplane X° = 0. 


Lemma IX.6.1 The propagation speed V of a hyperplane with normal 
v with respect to a Lorentzian frame at x with time vector u is given by 
the formula 
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TA 


(ucut + gare 


v= (IX.6.3) 


Proof. The vector of greatest slope in P with respect to the Lorentz 
frame with time vector u is the vector in P orthogonal to the 2-plane I 
that is the intersection of P and the hyperplane orthogonal to u. This 
intersection I satisfies the equations 


u°X,=0, v“Xa=0. (IX.6.4) 


Choose for the first space vector in the Lorentzian frame the projection 
of v on the hyperplane orthogonal to u; then (6.4) read 


Xo=0, v°Xot+v'X, =0. (IX.6.5) 


A vector in I therefore has components Xo = 0, Xı = 0, with X2, X3 
arbitrary. The vector of greatest slope Y in P orthogonal to I is uniquely 
determined by the conditions 


0 
YeP, ie v°Y+v!¥,=0, hence Y= = te (IX.6.6) 


and the orthogonality with J, that is, Yo = Y3 = 0. 
The propagation speed of P with respect to the considered Lorentz 
frame is 


vi=|#l|= 


: (IX.6.7) 


yl 


The given formula (6.3) takes the form (6.7) in a Lorentz frame where 
the time axis is the unit vector u and the normal v to P has components 
V2 = V3 = 0. E 


IX.6.3 Case of perfect fluids 


The Euler and entropy conservation equations of a perfect fluid are of 
first order. Using the entropy equation (5.4) in the energy equation (4.5), 
we see that the Euler—entropy equations written for the unknowns S, p, u 
have a characteristic matrix composed of two blocks around the diagonal. 
One of these blocks corresponds to the entropy S and its conservation 
law (5.4); it reduces to the linear form! 


a=u°X,. 
The other block corresponds to the unknowns p and u and the equa- 


tions (4.4) and (4.5). It is the following 5x5 matrix’ with p = p + p, 
Hp = Op/Op: 


14T> avoid confusion with pressure, 
we denote by X the vector previously 
denoted by p. 


15The four components of u are con- 
sidered as independent unknowns. The 
identity g(u,u) = —1 is preserved by 
the flow. 
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16Recall that we use geometric units; 
i.e. the speed of light is equal to 1. 


17That is, linear in the highest-order, 
m + 1, terms. 


a PH Xo pu, Xı p, X2 PHXs 
X? + au? pa 0 0 0 
M= | X!+au! 0 pa 0 0 
X? + au? 0 0 pa 0 
X? + au’ 0 0 0 pa 
The determinant of this matrix is computed to be 


— pta? D, (IX.6.8) 
with D quadratic in X and given by 
D = (up — 1) (U Xa)? + pX” Xa. (IX.6.9) 
We see that a perfect fluid has two types of wave fronts: 
e Matter wave fronts, f = constant, such that 
u-Oof = 0. (IX.6.10) 


They are submanifolds generated by the flow lines. Their propagation 
speed for a comoving observer (i.e. in a proper rest frame of the fluid) 
is zero. 


e Sound wave fronts, whose normals satisfy D = 0. In a proper rest 


frame gog = Nag (the Minkowski metric), u? = 1,u’ = 0. The 
corresponding eikonal equation reads 
— (of)? + X0: f)? = 0. (IX.6.11) 


The propagation speed of these wave fronts is, assuming p, > 0, 


=l 
|V| = (u) ž; 


it is less than the speed of light,'® as expected from a relativistic 
theory, if and only if 


u >i (IX.6.12) 


A fluid such that u, = 1 is called an incompressible or stiff fluid. 
In such a fluid, sound waves propagate with the speed of light. 


IX.7 Cauchy problem for the Euler 
and entropy system 


The Cauchy problem is the determination of the solution of a partial 
differential system from its initial data. Consider first one linear!” partial 
differential equation of order m+1 with unknown a function U on R"*+! 
where coordinates are denoted by 2°, x’, i=1,...,n. The initial data 
for U on the submanifold z? = 0 are the values for x° = 0 of U and 
its partial derivatives with respect to x° of order less than or equal to 
m. A Cauchy problem is said to be well posed, for initial data in some 
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functional space, if it admits in a neighbourhood of x° = 0 one and only 
one solution depending continuously on the initial data. The equation is 
called hyperbolic if it admits a well-posed Cauchy problem with data 
in spaces where only a finite number of derivatives are involved!® and 
if the solution of this Cauchy problem manifests a finite propagation 
speed. Hyperbolic well-posedness depends only on the coefficients of the 
principal terms, i.e. terms of order m+ 1. For a quasilinear differential 
equation (i.e. one that is linear only in the highest (m + 1)-order terms), 
the hyperbolicity depends on the values of the coefficients of these terms 
for the given initial data. By contrast, if the equation is semilinear 
(i.e. the coefficients of the highest-order terms do not depend on the 
unknowns), hyperbolicity does not depend on the initial data.!? 

In General Relativity, causality is an essential feature, saying that 
the properties of physical phenomena at one point depend only on the 
past of this point, determined by the causal paths?° of the Lorentzian 
spacetime metric g. The definitions of Cauchy problem and hyperbolicity 
extend to systems of partial differential equations, in particular to qua- 
sidiagonal systems. The wave gauge reduced vacuum Einstein equations 
are a quasidiagonal and quasilinear second-order partial differential sys- 
tems as studied in Chapter VIII; they are a hyperbolic system for data 
on a Lorentzian metric on a spacelike submanifold. They are a special 
case of a quasidiagonal Leray hyperbolic systems.7! 


IX.7.1 The Euler and entropy equations as a Leray 
hyperbolic system 
It can be shown”? that the Euler equations are equivalent to a quasidi- 


agonal system with principal terms in the diagonal either w°O, or the 
third-order operator 


u® [(u, — Luu? + uhg*?] 0,030.- (IX.7.1) 
The characteristic cone at a point is 
u*Xq [(u, — 1)uru? + ug] Xa Xp = 0. (IX.7.2) 


It is the union of a hyperplane u“Xa = 0, spacelike if u is timelike, 
reading (in a proper frame for u) 


(P) = {Xo = O}, 


and a second-order cone (C), reading (in a proper frame for u) 


(C) = {—(Xo)? + p YX? = 0}. 


It is easy to see by drawing a figure that if p, > 0, then any straight 
line passing through a point y in the interior of (C), which is 


—(yo)? + u, So (ui)? <9, 


18Mathematicians often use Sobolev 
spaces Hs. 


19One calls hyperquasilinear equations 
of order m + 1 in which the coefficients 
of the highest-order terms do not de- 
pend on derivatives of the unknowns of 
order > m — 1. The Einstein equations 
are hyperquasilinear. 


20In other words, physicists say that 
the speed of light is a maximum for the 
propagation of all observables. 


21 Leray (1953). See also Dionne (1962): 
this article by a student of Leray re- 
fines his results in an important way. 
It treats only the case of one equa- 
tion, but the results extend trivially to 
quasidiagonal systems. 


?2Choquet-Bruhat (1966). 
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23Note that this geometric criterion re- 
fers to cones in the dual of the tangent 
space to the spacetime manifold. 


24For details, see YCB-OUP2009, Ap- 
pendix IV, Section 2 or Leray (1953). 


25 Friedrichs (1954). The Leray hyper- 
bolic systems are much more general, 
containing equations of arbitrary or- 
ders, but some symmetric hyperbolic 
systems do not satisfy the criteria for 
Leray hyperbolicity. 


26 Anile (1989). 


27A FOSH system, for barotropic flu- 
ids, was found in Special Relativity 
by K. O. Friedrichs. Its construction 
in General Relativity was sketched by 
Rendall (1992). 


but not passing through the vertex, cuts the union (P) U (C) at three 
distinct points.?? This is the criterion of Leray hyperbolicity for a third- 
order operator, or a quasidiagonal system of such operators.?4 


IX.7.2 First-order symmetric hyperbolic systems 


An alternative to the Leray approach to proving the well-posedness 
of a Cauchy problem for evolution equations is the Friedrichs the- 
ory?” of first-order symmetric hyperbolic (FOSH) systems. The 
first-order linear system of partial differential equations on R”+t, 


OU? OU? 
Mij ot Ox? 


Mi; = fr, 

is called symmetric if the n+1, N x N matrices Mt, M’ with elements 
Me, Mj, are symmetric (i.e. M%, = MG). It is hyperbolic with respect 
to the x? = t coordinate, and is then described as simply symmetric 
hyperbolic, if, in addition, the matrix Mt is a positive-definite quad- 
ratic form. The Cauchy problem is then well posed if its initial data 
are given on a submanifold Sọ := {x° = constant}. In a quasilinear 
system, the matrices M“ are functions of the unknowns U, but not of 
their derivatives. The symmetric hyperbolicity depends then on the val- 
ues of these unknowns. The Cauchy problem with data on So is then 
well posed if the system is symmetric hyperbolic when in the coefficients 
the unknowns are replaced by their initial data. Note that in general, 
however, the solution exists only in a neighbourhood of So. 

It is straightforward to write for wave equations equivalent FOSH sys- 
tems (see Problem VIIT.12.10 in Chapter VIII). In the case of the Euler 
equations, the procedure is more subtle but may be preferred for some 
numerical computations. To obtain FOSH systems, one can apply gen- 
eral methods inaugurated by Lax, developed by Boillat and Ruggeri, and 
explained in the book by Anile.?° These authors use a convex functional 
and auxiliary unknowns. In the case of relativistic perfect fluids, one 
can perform a direct computation to show the equivalence of the Euler— 
entropy system to a FOSH system.?” The idea is to take as unknowns 
the pressure p, denoted by U°, and the space components ut, denoted 
by U!, I = 1,2,3. The component u? is determined through the identity 


u"Ug = —1. (IX.7.3) 
From this identity, one deduces 
ee al (IX.7.4) 
uo 


The energy—continuity equation yields the following first-order evolution 
equation for p and uw’: 


ui Vou’ Lulu? ap 


Viu’ = 
f P p+p 


IX, 
< (IX.7.5) 
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The combination of the equations of motion of indices 0 and å, 


i N) i 
u“Vau’ + (ge + aS rar 7 — s jeevan + (9%? + u%u?) 


oap 7 
+p 


9 


reduces to another first-order evolution equation for p and ut: 


a a. Oa 
ue (Vou! + “Mj Vou) (~ s e Fa (IX.7.6) 


ulug yo H + p 
Setting A := (u+ p)~+, the principal matrix of the system (7.5), (7.6) reads 
M = M°dq, 
A? upu do A (a z a) A (a = “200) A (a = 22a) 
1 ie z pan 
Aloe a TeSa Sura, Z3 Da 
= uo uuo ulug uuo 
— 2 1 2 2 
A (> 7 9p a uO, ge (1 n =i dn ous 3, 
as U MD i uuo u` uo 7 
u u“ uo u“ ugo u` uo 
(IX.7.7) 


It can be proved that the system is equivalent to a symmetric system 
with 09 = 0/0t and with Mt positive-definite?® if u is timelike and ?8See YCB-OUP2009, Chapter IX, 
u, > 1; that is, to a FOSH system. Section 13. 


Exercise IX.7.1 Show that (7.5), (7.6) are equivalent to the original 
Euler—entropy equations. 


IX.8 Coupled Einstein—Euler—entropy 
system 


IX.8.1 Initial data 


An initial data set for the Einstein—Euler system with a given equation 
of state will be the usual data for the Einstein equations together with 
data for the fluid source. 

We have seen that an initial data set for the Einstein equations is a 
triple (M,g, K) with M a 3-manifold, g a properly Riemannian metric, 
and K asymmetric 2-tensor on M. A spacetime (V, g) is said to admit 
these initial data if M can be embedded in V as a submanifold Mọ with 
induced metric g and extrinsic curvature K. If (V, g) is a solution of the 
Einstein equations with source the stress-energy tensor T, then (g, K) 
must satisfy the following constraints: 


R(g) —K-K + (tr K)? = 2p, (IX.8.1) 


V-K-VtrK = J. (IX.8.2) 
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29See Lichnerowicz (1955) and refer- 
ences therein. The dynamical velocity 
was used by Fourès (Choquet)-Bruhat 
(1958) for the first proof of the well- 
posedness of the Cauchy problem for 
relativistic perfect fluids, coupled or 
not with the Einstein equations. 


In a Cauchy adapted frame, where the equation of M = Mo in V is 
x? = t = 0 and the time axis is orthogonal to Mo, one has, in the case 
of a perfect fluid, 


p = NPT” = N? (u + p) (u?Y? — p, (IX.8.3) 
J’ = NT® = N(u + pjutut, 


Nu? = (1 + gijutut) 3. (IX.8.4) 
Using u“ua = —1, that is, 
Nu? = (1 + gijuiu)?, (IX.8.5) 


we see that on Mo the quantities p and J depend only on the values jf 
and p of u and p on Mo, and on the projection v of u, with components 
vt = ut, on Mo; they do not depend on the choice of lapse and shift. The 
initial data on Mp for a perfect fluid with equation of state u = u(p, S) 
are two scalars p and § and a tangent vector U. Data for the third-order 
system are obtained by using the restriction to Mo of the Euler equations 
and its first-order derivative in the direction of u. This computation now 
requires a choice for initial lapse and shift, since this is also required for 
the solution of the reduced Einstein equations. 


IX.8.2 Evolution 


The Einstein equations reduced in harmonic gauge, with perfect fluid 
source, read as a second-order quasidiagonal system for the metric whose 
principal part is the wave operator of the spacetime metric while the 
fluid variables appear at order zero. It is straightforward to show that, 
together with the Euler—entropy equations, they form a Leray hyperbolic 
system provided that u, > 0. However, the system is causal (i.e. the 
domain of dependence is determined by the light cone) only if u, > 1. 
It is also straightforward to write in that case the coupled system as 
a FOSH system. We have already described in Chapter VIII the proof 
that a solution of the reduced system is a solution of the full system if 
the initial data satisfy the constraints. 


IX.9 Dynamical velocity 


IX.9.1 Fluid index and Euler equations 


Important properties of non-relativistic fluids generalize to relativis- 
tic fluids if one introduces a spacetime vector called the dynamical 
velocity?” linked with both the kinematic unit vector u and the 
thermodynamic quantities. 


In the case of barotropic fluids, i.e. with equation of state u = u(p), 
the simplest way is to define a function f(p), called the index of the 
fluid, by 


f(p) := exp be S (IX.9.1) 
po H(p) + p 
and the dynamical 4-velocity by 
Ca = fua, hence C%C, =—f?, CÊVaCg = -—fôaf.  (IX.9.2) 


Theorem IX.9.1 For a barotropic fluid, the Euler equations (4.4) and 
(4.5) are equivalent to the equation 


C°(VaCg — VeCa) = 0 (IX.9.3) 
and 


CEC. ay, Ol 
ORON VaCg = 0, with Hp = Op’ 


Exercise IX.9.1 Prove this theorem. 


Exercise IX.9.2 Show that the flow lines of a perfect fluids are 
geodesics of a metric confomal to the spacetime metric. 


Val? + (uh, — 1) (IX.9.4) 


Hint: Show that g = Fg implies 
C°V Cg = C°VaCg + C%C,0g(log F). 
Corollary IX.9.1 A barotropic relativistic fluid is incompressible 
(u, = 1) if and only if”? 
Val” =0. (IX.9.5) 


Assuming an equation of state and expressing p as a function of f, 
and hence of C, the equations (9.4), (9.5) read as a first-order differential 
system for C. 


Exercise IX.9.3 Write these equations for the equation of state 
p=(y- De. 


IX.9.2 Vorticity tensor and Helmholtz equations 


The vorticity tensor is defined through the dynamical velocity as the 
antisymmetric 2-tensor 


Qag = Ve V 8Co. 


It results from the equations of motion (9.3) that this tensor is 
orthogonal to the velocity; indeed, these equations read 


Cag = 0. (IX.9.6) 


Theorem IX.9.2 (relativistic Helmholtz equations) The Lie derivative 
of the vorticity tensor Q with respect to the dynamic velocity C vanishes: 


LoQ= 0. 
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30This property of incompressible flu- 
ids generalizes the classical one for 
Newtonian fluids, 0;v’ = 0, implied in 
this case by the constancy of density. 


224 Relativistic fluids 


Proof. Since Q is an exterior 2-form, the differential of the 1-form C, it 
is a closed form, satisfying in local coordinates the identity 


V tly EV get Vga = 0, 
and hence 
COV apy + CPV Nag + CPV gQya = 0. (IX.9.7) 
The equations of motion (9.3) imply, by the derivation V4, 
COV lag + Nag V4 = 0. 
Using this equation, the identity (9.7) gives the equation 
CPV apy — NapVyC* — QyaV gC = 0; 


that is, using the expression for the Lie derivative of a covariant tensor 
and the antisymmetry of Q, 


(LoQ) ay = CPV eQpy + VCO OQ Ba + VBC Nay = 0. (IX.9.8) 
Oo 


We have shown that the vorticity tensor Q satisfies a linear differen- 
tial homogeneous system along the flow lines, and hence we have the 
following corollary: 


Corollary IX.9.2 If a smooth (barotropic) flow has vanishing vorti- 
city on a 3-submanifold transversal to the flow lines, it has a vanishing 
vorticity on the domain of spacetime spanned by these flow lines. 


IX.9.3 General perfect fluid enthalpy h 


For non-barotropic fluids with equation of state depending on the 
entropy S, one introduces the enthalpy h, defined through the thermo- 
dynamic identity 


dh = r™'dp + TdS, (IX.9.9) 
or, equivalently, because of the thermodynamic identity (5.3), 
dh = d|r (u+ p)], weset h:=r '(u+p), (IX.9.10) 
and now define the dynamical velocity by 
C® := hu, setting Qag := VaCg — VeCa. (IX.9.11) 
Exercise IX.9.4 Show that 
C°V gCo = —hdgh. (IX.9.12) 
Exercise IX.9.5 Show that the equations of motion are equivalent to 


(AT) +O Ogg = 8gs. (IX.9.13) 


IX.10 = Irrotational flows 


IX.10.1 Definition and properties 


A flow with zero vorticity is called irrotational, its trajectories are 

orthogonal hypersurfaces, since the equations V_C'g — VgCq = 0 imply 

that there exists on spacetime (at least locally) a function ® such that 
Ca = Oa®. 

Theorem IX.10.1 The irrotational flow of a perfect fluid is governed 


by a quasilinear wave-type equation.’ 


Proof. When the flow is irrotational, (9.3) is satisfied identically while 
(9.4) reads 

[gP + (1 — u )uu?] Vade® = 0. (IX.10.1) 
In this equation, the quantities u and p are given functions of ® and 0®. 
Indeed, by definition, 


“p 
gabs =- f? and u= 2 


while p can be expressed in terms of f, and hence of 0®, by inverting 
the relation (9.1). 

The characteristic cone of this quasilinear second-order differential 
equation for ® is given by the quadratic form 


whose dual is the sound cone. This quadratic form is of Lorentzian signa- 
ture if and only if u, > 0; the equation satisfied by is then hyperbolic. 
It is causal (the sound cone is interior to the light cone) if w,>1. m 


Remark IX.10.1 In the case of a stiff fluid, p, = 1, the equation for 
® reduces to the usual linear wave equation. 


Exercise IX.10.1 Check the quadratic form (10.2) using (6.9). 


IX.10.2 Coupling with the Einstein equations 


The Einstein equations in harmonic gauge with source given by an ir- 
rotational flow, together with the equation for ®, form a second-order 
quasidiagonal system whose principal parts are the wave operator of the 
spacetime metric or the sound wave operator. It is a causal hyperbolic 
system if u, > 1. 


Remark IX.10.2 The solution can be interpreted as an Einsteinian 
spacetime with source an irrotational flow as long as O® is timelike, i.e. 
gL ða POg® < 0. 


Exercise IX.10.2 Formulate the Cauchy problem and a local exist- 
ence theorem for the Einstein equations coupled with a perfect fluid in 
irrotational flow. 


IX.10 Irrotational flows 


31 Fourès (Choquet)-Bruhat (1958). 
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32Cattaneo (1959), Ferrarese (1963), 
and Ferrarese and Bini (2007). 


33 Friedrich (1998). 


34See YCB-OUP2009, Chapter IX, 
Section 14, or Choquet-Bruhat and 
York (1998), which uses the Riemann 
tensor, instead of the Weyl tensor used 
by Friedrich. 


35 Christodoulou (2007). 


IX.11 Equations in a flow-adapted frame 


It was stressed long ago by Cattaneo and Ferrarese’? that the natural 
physical quantities are timelines and not spacelike submanifolds. Using 
the same kind of formalism as given in Chapter VII for Cauchy adapted 
frames, but for Lorentzian frames with time axis tangent to the timelines 
(time-adapted frames) Friedrich?’ has written the perfect fluid equations 
as a symmetric system for the pressure and the connection coefficients. 
This system is hyperbolic with respect to a time function, but only under 
additional appropriate conditions.*4 


IX.12 Shocks 


A shock is a discontinuity in the fluid variables across a timelike n- 
manifold (n = 3 in the classical case). The stress—energy tensor is then 
discontinuous; its derivative is meaningful only in a generalized sense. 

The relativistic Rankine—Hugoniot equations express the van- 
ishing of the divergence of the stress-energy tensor in the space of 
generalized functions (distributions); they are 


a(T*"] = 0, 


where n is the spacelike normal to the timelike shock front © and [T°?] 
is a measure with support X. A deep study of the formation of shocks 
for fluids in Special Relativity has been made by Christodoulou.*° 


IX.13 Charged fluids 


IX.13.1 Equations 


The stress-energy tensor of a charged perfect fluid is the sum of the 
stress—energy tensor of the fluid variables and the Maxwell tensor of the 
electromagnetic field F: 


Top = Teg) + Taps (IX.13.1) 
with 
fluid 
Top) := (u + p)uaug + pgag (IX.13.2) 
and 
A 1 Au 
Tag = Fa" Fa, — 79apF "Pau (IX.13.3) 


The state of the charged fluid is governed by the Einstein equations 
Sop = Tap, (IX.13.4) 
the Maxwell equations, with J the electric current, 


dF=0 and V. F=J, ie. VafF® =J”, (IX.13.5) 


and the conservation equations, which are 
Val? = ValTaaiay +7) = 0. (IX.13.6) 
Using previous formulas, we see that these conservation equations read 
(ut pu Vau? + gap + uP [Valu + pju] + Jy F> = 0. (IX.13.7) 


Classically, the electric current J is the sum of a convection and a 
conduction current: 


Jy = qua + 0E), By = u” Fiz (IX.13.8) 


where q is a scalar function (the electric charge density), ø is the electric 
conductivity (in general assumed to be constant), and EF is the electric 
field. 

The current J satisfies the conservation equation 


VaJ® =0. (IX.13.9) 


As for uncharged fluids, taking the contracted product of (13.7) with u 
gives a continuity equation 


(u+ p)Vqu® +u“ðau — oE" Ea = 0 (IX.13.10) 
and equations of motion 
(u+p)ueV qu? +g doptul (u°dap+o E% Ea) +J FP = 0. (IX.13.11) 


Modulo initial conditions, as in the vacuum case (see Chapter VIII), the 
Maxwell equations are equivalent to wave-type equations 


(dé + ôd)F = dJ. (IX.13.12) 
One can add to these equations the law of particle number conservation 
Val(ru*) =0 (IX.13.13) 


and consider that u is a function of p and r. If the equation of state is 
given by p as a function of the pressure p and entropy S, then (13.13) 
must be replaced by a thermodynamic law. 


IX.13.2 Fluids with zero conductivity 


When the conductivity is zero, the Lorentz force reduces to gE, which is 
orthogonal to the flow vector u and hence does not furnish any work. One 
says that the fluid is non-dissipative. The first law of thermodynamics 
and particle number conservation then imply, as in the uncharged case, 
the law of conservation of entropy: 


u“ða S = 0. 
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36 Choquet-Bruhat (1965). 


The Euler-entropy equations are then expected to be a hyperbolic 
system. This is confirmed by the following theorem: 


Theorem IX.13.1 The Einstein equations in wave gauge with sources 
an electromagnetic field of potential in Lorentz gauge and a charged per- 
fect fluid with equation of state u = u(p, S) and with zero conductivity 
(o = 0) are, if m, > 0, a hyperbolic system for g, q, u, p, and the 
electromagnetic potential A. This system is causal if Bp Pel 


A solution of the Cauchy problem for the full Einstein—-Maxwell— 
charged fluid system, for initial data satisfying the constraints, can be 
deduced by methods analogous to those used for uncharged fluids. Apart 
from the addition of electromagnetic wave fronts, the wave fronts are the 
same as for uncharged fluids. 


IX.13.3 Fluids with finite conductivity 


When the conductivity ø is finite, the Lorentz force is no longer orthog- 
onal to u: it works under the fluid flow. The fluid is called dissipative and 
its properties are expected to be different from those of non-dissipative 
fluids. Indeed, experiments show, for example, that no shock wave propa- 
gates in charged mercury, a liquid with non-zero conductivity. This fact 
suggests that for such a fluid, the Cauchy problem is not well posed in 
spaces of functions with a finite number of derivatives. Indeed the first 
law of thermodynamics now gives the entropy equation (S increases 
along the flow lines if ø > 0, as physically predicted) 


Tu 0.5 = o E” Ex. 
The equation expressing conservation of the electric current J, 
Va(qu® + ou, F) = 0, (IX.13.14) 


contains derivatives of the electromagnetic field F, and hence second 
derivatives of the potential A. The characteristic determinant of the 
Euler—entropy—electromagnetic system is the same as in the case of zero 
conductivity, but the characteristic matrix is no longer diagonalizable; 
the system is not hyperbolic in the sense defined before. 

Fluids with finite, non-zero, conductivity are a physical example*® 
of Leray—Ohya hyperbolic systems (which the authors called ‘non- 
strictly hyperbolic’). Such systems have solutions in Gevrey classes, 
spaces of C° functions whose derivatives satisfy inequalities weaker 
than the inequalities satisfies by the derivatives of analytic functions 
that ensure the convergence of their Taylor series. These Gevrey classes 
enjoy the property fundamental in Relativity of being able to possess a 
compact support without being identically zero. This permits the limi- 
tation by the speed of light of signal solutions of Leray—Ohya hyperbolic 
systems. 

It can be proved that if Hp > 1, then the reduced Einstein-Maxwell- 
Euler—entropy system of a charged perfect fluid with finite, non-zero, 


conductivity is Leray—Ohya hyperbolic (and causal) but not hyperbolic 
(in the strict sense).°” 


IX.14 Magnetohydrodynamics 


In the case where the electric conductivity is so large that it can be con- 
sidered as infinite, the proper-frame electric field Ea = uf Fag becomes 
negligible. The case of a zero electric field is called magnetohydrodynam- 
ics.38 It plays a fundamental role in plasma physics. 


IX.14.1 Equations 


In three space dimensions, for a fluid with infinite conductivity 
o = œ, the electromagnetic field reduces, in the local frame defined 
by the fluid velocity, to the magnetic vector H. The second Maxwell 
equation ôF = J is replaced by 


E® := ug FP = 0. (IX.14.1) 
From the definition of the vector H (orthogonal to u), 
H® = ug(F*)°*, (IX.14.2) 
one deduces 
(F*)8* = Hu? — FP ye, (IX.14.3) 


A straightforward calculation gives the following expression for the 
Maxwell tensor: 


1 
Tag = (taus + T |H|? — Ha H6. (IX.14.4) 
The first Maxwell equation dF = 0 becomes 
Va(F*)®* = Vo(H*u® — HP u?) = 0. (IX.14.5) 


Modulo these equations, the divergence of 7 is found to be 


oa cil 
Var” = (uu? T Ta a| H|? T |H]? Va(u"uf) = Vala"). 
(IX.14.6) 


Straightforward computation shows that it is a vector orthogonal both 
to u and to H. 

The Lorentz force Vat? being orthogonal to u, the continuity equa- 
tion for a fluid of infinite conductivity is the same as for an uncharged 
fluid, namely 


Vol(ut p)u®] — u" dap = 0. (IX.14.7) 
The equations of motion read 
(ut p)utV que + (9%? + uPu%)Oap + Vat? = 0. (IX.14.8) 
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37K. O. Friedrichs has obtained for a 
charged perfect fluid with non-zero fi- 
nite conductivity a symmetric hyper- 
bolic system, but by modifying the 
equations in a way that is not physic- 
ally justified. 


38Equations for magnetohydrodynam- 
ics in Special Relativity, keeping terms 
neglected in classical magnetohydro- 
dynamics, were obtained by Hoffman 
and Teller (1950). In General Relativ- 
ity, they were first written down and 
studied by Bruhat (Choquet-Bruhat) 
(1960). 
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IX.14.2 Wave fronts 


A straightforward computation shows that the characteristic polynomial 
of the first-order system (14.5), (14.7), (14.8) factorizes as 


[(u+ pla? — b}? D, with a:=u%X,, b:= H°X,,  (IX.14.9) 
and with D a fourth-order polynomial in X: 
D = (m, — 1)(u + p)a* + [(u + pt |Al?u,)a? — b’ X“Xa. (IX.14.10) 


The fourth-order cone D = 0 is called the magneto-acoustic cone. 
The second-order cone, which enters as a double factor in (14.9), 


(u+ p)a? — b? = 0, (IX.14.11) 


is called the Alfvén cone. It is composed of two hyperplanes 


[(u + p)2u* + H°]Xa =0. (IX.14.12) 


The corresponding wave fronts, tangent to the dual of a Alfvén plane, 
are called Alfvén waves. 

In the proper frame of the fluid, the normals v to the Alfvén waves 
satisfy the equations 


(u + p)?v + Hiv; =0, (IX.14.13) 


The propagation speed of the Alfvén waves is calculated using the 
general formula 


a 2 
Ve 3 c] (IX.14.14) 
uul + geL Vang 
One finds that 
|Vai] = (u + p)~? Hol, (IX.14.15) 


where H, is the scalar product of H (a space vector) with the projection 
on space (normed to 1) of the normal v to the wave front. 

The wave fronts whose normals lie on the magnetoacoustic cone D = 0 
are called magnetoacoustic wave fronts. The equation of the magne- 
toacoustic cone is a polynomial of order 4 in v, the normal to a wave 
front, which reads, in a proper rest frame of the fluid, 


mp(u + p+ H?)(io)* — (u + p+ |H’ u, + HZ) + HZ =0, (IX.14.16) 


where we have denoted 

- v zi 
We deduce from this formula, solved for the unknown i, the two mag- 
netoacoustic wave speeds. Both are well defined, because |H;| < |H], 
and less than 1, the speed of light, if ju, > 1. They are 


[Vita 


1 
_ p+p+|H ‘up + H3 |u + p+ |H? u, + AZ)? — 40g u,(u+ p+ H?) 


1X.14.18 
2u,(u+ p+ H?) 


The rapid wave speed corresponds to the plus sign and the slow wave 
speed to the minus sign. 

Study of the characteristic determinant shows®® that the system is a 
Leray—Ohya-type hyperbolic system—the fact that Alfvén wave fronts 
are tangent to the magnetosonic wave fronts does not permit one to 
conclude its (strict) Leray hyperbolicity. However, the system has been 
proved to be symmetrizable hyperbolic*’, both by the general method of 
Lax—Boillat and Ruggeri, and directly (at least in special Relativity) by 
K.O. Friedrichs. This hyperbolicity could be foreseen physically by the 
existence of shock waves*! and of high-frequency waves‘? propagating 
according to first-order differential equations. 


IX.15 Yang-Mills fluids (quark—gluon 
plasmas) 


Although Yang-Mills charges are not manifest at ordinary scales, plas- 
mas of quarks and gluons exist under extreme circumstances.*? Their 
properties have analogies, but also differences, with those of electro- 
magnetic plasmas. The equations look formally the same. The Maxwell 
stress-energy tensor is replaced by the Yang-Mills stress—energy tensor 
and the electric current by the Lie-algebra-valued Yang-Mills current 


J% = ut + o Fug, 


where y is a function on spacetime taking its values in the same 
Lie algebra as the Yang-Mills field F and ø is a number, the fluid 
conductivity. 

The mathematical properties of Yang-Mills fluids are“* quite simi- 
lar to those of electrically charged fluids, at least locally, in the case of 
finite conductivity. 

In the case of infinite conductivity, however, a remarkable property 
occurs: the wave fronts do not split into analogies of Alfvén waves and 
acoustic waves, but are at each point tangent to an undecomposable 
sixth-order cone.*° 


IX.16 Viscous fluids 


There have been many discussions of equations for relativistic viscous 
fluids and a relativistic heat equation. Most proposals for such equations 
were motivated by the desire to obtain a causal theory, that is, one de- 
scribed by partial differential equations of hyperbolic type, with a causal 
domain of dependence, whereas the usual Fourier law of heat transfer 
as well as the viscous fluid Navier-Stokes equations of Newtonian mech- 
anics are of parabolic type, corresponding to an infinite propagation 
speed. We will see in Chapter X some results for the approximation 
beyond perfect fluids obtained in the framework of Relativity. 
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39See Choquet-Bruhat (1966). 


40See Anile (1989). 


41General relativistic equations were 
given by Bruhat (Choquet-Bruhat) 
(1960) and were studied in depth by 
Lichnerowicz (1967). 


42 Anile and Greco (1978). 


43 Extremely briefly in Earth-based la- 
boratories. Astrophysicists have conjec- 
tured that they are present in the cores 
of neutron stars. 


44See Choquet-Bruhat (1992). 


45 Choquet-Bruhat (1992b, 1993). 
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46Landau and Lifshitz (1987) link u 
with some density of energy, choosing u 
such that the stress tensor is orthogonal 
to u. 


IX.16.1 Generalized Navier-Stokes equations 


The Einstein equivalence principle would give us an expression for the 
stress—energy tensor of a viscous fluid in General Relativity if we knew of 
such as expression in Special Relativity. However, this is not the case— 
there is no general consensus about such a tensor in Special Relativity, 
everyone being influenced in his choice by his own background. The dif- 
ference between various authors begins already at the physical definition 
of the flow vector*® u. We will stick to the simplest definition, consider- 
ing u as the flux density vector for the particle number r. This quantity 
satisfies the conservation law 


Va(ru%) =0. (IX.16.1) 


It was remarked previously that the components T°°, T”, T'I of the 
stress-energy tensor T in a proper frame represented respectively the 
energy, momentum, and stresses of the fluid with respect to this frame. 
For perfect fluids, the expressions for these quantities were taken as their 
classical analogues, which themselves result from first principles. 

In non-relativistic mechanics, the stress tensor of a viscous fluid is 
obtained by considering it as a linearized perturbation ø of the perfect 
fluid stress tensor, a perturbation due to the deformation tensor of the 
flow lines. In inertial coordinates of absolute space, this deformation 
tensor is 

Dy = Ovi re ot 

Oxi Ox" 

One writes the classical Navier—Stokes equations for viscous fluids by 
introducing the expansion and the shear of the flow lines (in Euclidean 
space E’). The contribution of rotational flow is discarded, on the basis 
that it corresponds to rigid motions, which do not generate stresses. The 
Newtonian stress tensor of a viscous fluid is then written as the 
sum of the stress tensor pd;; of a perfect fluid and a symmetric 2-tensor 
given by 


(IX.16.2) 


k k 
The scalar coefficients, in general considered as constants, are called the 
bulk viscosity and the shear viscosity. 

The extension of the tensor ø to Relativity is ambiguous, owing on 
the one hand to the absence of absolute space and on the other hand 
to the equivalence of mass with energy, and hence with the work of the 
friction forces due to viscosity. The following tensor has been proposed 
as a relativistic generalization of the classical one: 


Tap = (H + p)tatia + PIJap + Cap, (IX.16.4) 


with o the viscosity stress tensor, linear in the first derivatives of u 
and orthogonal to u, 


Tag = ATapV pu? + vabTg(V pts + Vote), (IX.16.5) 


where 7 is the projection tensor*” 


Tab := Jap + Uap. (IX.16.6) 
Some authors add to T a ‘momentum term’ 
UadB ag UBda, (IX.16.7) 


supposed to represent a heat flow resulting from friction. The problem is 
then to write an equation to determine q and justify the symmetrization 
in (16.7). 

The equations for VT? deduced from (16.4) and (16.5) are rather 
complicated. They have been studied in detail by G. Pichon*® in the case 
when Vau“ = 0, called an incompressible fluid. He has shown that for 
large enough u + p, they are of parabolic type, corresponding therefore 
to infinite propagation speed. 


IX.16.2 A Leray—Ohya hyperbolic system for viscous 
fluids 


We have seen that the vector that leads to the generalization of classical 
properties of irrotational motion, for perfect fluids, is the dynamical 
velocity C := h u, with h the enthalpy (see Section IX.9.3). We define 
for this vector C the shear, the vorticity tensor, and the expansion of the 
congruence of its trajectories (the flow lines of the fluid) by the usual 
decomposition. In the case n + 1 = 4, 


1 il 
Ve = Qo t La t (S) aß,» 
B = 3 ap B 4 Jap 
with Q the vorticity tensor, 
Qag i= VpCs — VpCo- 


In the case of a perfect fluid, Q is orthogonal to C (i.e. to u). The scalar 
© is the expansion of the congruence defined by the vector field C: 


O := VaC™. 


We have seen that © is zero if and only if the fluid is incompressible (in 
the relativistic sense). The symmetric tensor © is the shear, which has 
zero trace: 


1 1 
Zag := 9 (Vals + V pCa) = 9enVxC™. 
We propose to take as stress-energy tensor of a viscous fluid the tensor 
Tag = (u + p)tatig + Gasp + Tap, (IX.16.8) 


where T, a8 is the part of T due to viscosity, a perturbation of the perfect 
fluid stress-energy tensor, linear in VC, of the form 


Tag = AO Gap + Vas, (IX.16.9) 
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47Note that the coefficients A and v do 
not have quite the same interpretation 
as in Newtonian mechanics. 


48Pichon (1965). 
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49A singular perturbation with higher 
derivative has been proposed by Ver- 
notte to give an account of the few 
moments occurring before the steady 
state is attained. Vernotte’s idea has 
been extended to General Relativity by 
Cattaneo. 


with A and v viscosity coefficients depending on the particular fluid 
under consideration. The conservation laws Va T®® = 0 are then second- 
order equations for the dynamical velocity C with principal part 


1 1 
MGV VC Hy xe" ral Ohad ee Taa 


1 
= 5(VaV70" + VaVfC®) + Q — 7”) Vela. 


The 4 x 4 characteristic matrix has elements 


“Xa X“? +aXaX’C*, a:= = +A. 


To compute the characteristic determinant, we choose a frame where 
X? = X?’ = 0. The matrix then reads 


Wa + aXyX° aXı X? 0 0 
aXoX! Pa +aXıX! 0 0 
0 0 Y xax, 0 
2 V 
0 0 0 5X*Xa 
(IX.16.10) 


The characteristic determinant is then 


A(X) = P(X) (5 + a) (X° Xa)?, 


P(X) := (SX°Xa Ee aXoX°) (ZX Xa Es aX: X") — a2X1X1X°X, 


V va 2 V va ay— Y/Y a 2 
P(X) := (5x Xa) + £X*Xq(aXaX%) = 5 (5 +2) (X° Xa). 
The proposed system is Leray-Ohya hyperbolic (Gevrey class of index 
2) and causal, a satisfactory property for a relativistic theory. 

It can be conjectured that a solution of the proposed relativistic 
Navier-Stokes equations converges to a solution of the perfect fluid 
equations when à and v tend to zero. 


IX.17 The heat equation 


We personally think that heat transfer is a collective effect; the Fourier 
equation corresponds in fact to some asymptotic steady state.*? It is 
not surprising that it does not translate readily into relativistic causal 
equations. The classical heat equation appears after the study of the 
Brownian motion, unfortunately the extension to Relativity of the study 
of Brownian motion is still in its infancy, owing to several difficulties, in 
particular the lack of absolute time. 


The Muller—-Ruggeri extended thermodynamics theory offers a way to 
treat dissipative phenomena in a relativistic context (see Chapter X). 


IX.18 Conclusion 


The relativistic theory of perfect fluids is mathematically and physic- 
ally? satisfactory. With proper definitions, the theorems of classical 
(i.e. non-relativistic) perfect fluid theory are valid. Global mathemat- 
ical problems remain mostly open, but this is also the case for classical 
fluids in spite of recent progress. A book by Christodoulou?! treats in 
depth three-dimensional perfect fluids in special Relativity up to shock 
formation. 


IX.19 Solution of Exercise IX.6.2 


Light cone: The light rays issuing from a given point satisfy the 
equations 

dx 3 : 

ar 9°? pg, 9°’ Pape = 0. (IX.19.1) 
Hence, with the usual identification of the tangent and cotangent space 
to V through the metric g, 


Jagu? =0, w= 9° pp. 


IX.20 Problems 


IX.20.1 Specific volume 
Show by integrating the equation 
Va(ru*) =0 
and using the Stokes formula that the t-dependent space scalar 


ru? N (IX.20.1) 


r 


represents the density of flow lines crossing a spacelike submanifold, 
where N denotes the spacetime lapse. 


IX.20.2 Motion of isolated bodies 


Denote by (V,g, r, u) the manifold, spacetime metric, and perfect fluid 
flow solution of the Einstein equation with perfect fluid source with 
initial data (g, K,0,7) on a manifold M satisfying the hypothesis of 


IX.20 Problems 235 


50In spite of problems with the choice 
of an equation of state for the fluid, 
particularly when the whole cosmos is 
concerned, although it is unlikely that 
its content can be modelled by a perfect 
fluid. 


51 Christodoulou (2007). 
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the existence and uniqueness theorem. Assume that the support of the 
initial density 7 is the closure © of some open subset w C M, which has 
several disconnected components, spaces occupied by material bodies at 
the initial time. 

Show that the solution does not depend on the ‘unphysical’ data of 0 
where 7 vanishes. 

Extend the property to charged dust. 


Solution 


Equations (2.4) and (2.3) show that the support of r is contained in 
the geodesic tube Q generated by geodesics of g tangent to w issuing 
from points in œw. Let vı be another initial value for u, such that 0; = v 
on w. The geodesic tube Qı of g corresponding to 0, coincides with Q; 
therefore, u; = u in Q. Since r = 0 outside Q, the dust stress—energy 
tensors ru & u and ru, ® uy with u1 = u in Q coincide on the whole of 
V; therefore, the triple (g, r, u1) satisfies the Einstein dust system on V. 
By the uniqueness theorem, it coincides, up to isometry, with another 
solution (g1,71, u1) taking the same initial values, i.e. gı = g, rı =r on 
V and u = u in the domain Qı = Q of the presence of a fluid flow. 

The theorem on the motion of isolated bodies extends to the case of 
charged dust if we reasonably make g/7 = 0 outside the support of 7 
because the flow lines of u depend only on g and F, while the right-hand 
sides of (3.10) vanish outside the support of 7. 


IX.20.3 Euler equations for the dynamic velocity 
Recall that 


C°Cy = —f?, hence C°VgC, = —fdgf. 


Write the equations of motion and the energy equation of a perfect fluid 
in terms of the dynamic velocity. 


Solution 


We deduce from the definition of the index f that 


Ologf 1 Bencė Op Op u+p 
Op wtp Ologf "7P aft 2P 
dp __ Op dlogf (AP) _ 9, ytd 
Of? Oloef Of \af) — “*? Faz 
Hence 
apa l ores ines, S22 =f Onf =—f "CP V aC3. 
of? +p 


Recall that the Euler equations of motion are 


ap 
“Yy l ted l a = 
u° Vaug + (93 + wus) oP 
We have 
u“Vaug = f 10°V al fC) = J (C VaCe — f° CCa f0.f) 
(IX.20.2) 
and 
a a ap = a a —2 — f-2 : —2 rma 
(gg+u aor = (ggtuue)f “fOaf = f“(foeft+f “C°Cg fodaf), 
(IX.20.3) 


which reads 


ap =) 
urV oug + (93 +u%u = C°VaCg — C°V gC), 
B + (93 ET T B pCa) 


and hence, for the equations of motion, 


C°Qag = 0. (IX.20.4) 


The energy (also called continuity) equation for a barotropic fluid 
reads 


oap du 
Vau” tur = 0, hss —, TX.20.5 
u u Ve pap Ly dp ( ) 
We have 
Oa E 2 
Vau” + u“ u, P = Valf 1C) + f sO uT "FoF 
f+ p 


=f [VaC* + C°f-?(-1+ 4)] - 
Hence, for the energy equation, 
VaC®| + f-7(-1+ up)C* 3af = 0. 
This reduces to VaC° = 0 if jz, = 1 (incompressible fluids). 


IX.20.4 Hyperbolic Leray system for the dynamical 
velocity 


1. Prove that the equations satisfied by the dynamical velocity C, 
C°Qag =0 with Qag := Vals — VeCa 
and 
Val = (uy = Doge 
imply, when 62 is known, that C satisfies a quasidiagonal system of 
second-order differential equations that is hyperbolic if du/dp > 0. 


CoV Ca, 


2. Show that C satisfies a quasilinear quasidiagonal third-order system, 
which is hyperbolic and causal if du/dp > 1. 
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Solution 


1. For C considered as a 1-form, we have the identity (see Chapter I) 


{(d6 + 6d)C}, = —g VaVgCy + RSCq. (IX.20.6) 


The definitions of the rotational and the energy equations give 


co 
dC=Q and 6C:=—VaC* = —(u, — VG 


CPC. 


(IX.20.7) 
It results from (20.6) and (20.7) that 


a Oe ce a 
gV aV pCa — BAC, = <7, @ “Dang 0 a.Cs| dP Oa: 
(IX.20.8) 


We compute 
Vy os, - Dae GPT «C| = Alp — 1) + Bô, (u, — 1), 
where we have set 
A:=V (yaC ) , and B:= O 9 : 

Y CO, B A B 

It holds that 
b B 

= Oro, Veng + (v "OO, ne) (VaCsg). 
By the antisymmetry of the Riemann tensor, we have 
C OPV yV aC = O70? (VaV yCp + Ryaipp CP) = C°C?VaV4Ce. 
By the definition of the vorticity Q, 

O° CPV Veg = CCV (Qua + VaCy) 
= O° Oo VaVpCy + CP CT ays; 

with, by the Helmholtz equation, 


CPV ap = UaV HC? + Nay Vg. (IX.20.9) 
Therefore, 
1 
= an B a a 
A = GG [C°CPVaV Cy + C7 (Opa V0% + Ray VC] 


We deduce from these computations, since Q is of first order in 
C, that when V°Q,, is known, the dynamical velocity C satis- 
fies a system of quasidiagonal second-order equations with principal 
operator 


(0,00) := o” + (un — 1) (S) | vac. (IX.20.10) 


The first-order terms in C are 


One? ma Gace 
Dy := (u — 1) |- onn Val" + (v ac) (VCs) 


In a proper frame, the corresponding characteristic polynomial is 
= u, X +X (Xi). (IX.20.11) 
It is hyperbolic if u, > 0, causal if 4, > 1. It reduces to the Minkowski 
polynomial for w, = 1. 
2. Since V“Qag is second-order in C, the principal part of the full oper- 
ator does not reduce to the operator (20.10). To obtain a hyperbolic 


operator for C, we take the derivative of (20.4) in the direction of C. 
We have 


C`VAV aNg = V (CAV Mag) — (VCA) V Nag. 


Hence, modulo the Helmholtz equations, C*V,V,2° g is a quadratic 
polynomial in VC and V?C, namely 


Eg := OV AV Os 


IX.20.12 
=V ° (Ng, Val* + Qa VC’) — (V°C*) Vy Qag. l ) 


We have proved that C®(VaOc,gC + 6Q) is a quasidiagonal third- 
order system with characteristic polynomial in a proper frame 


FXo |-u,X8 + (Xi)? . 


The lower second-order terms in C are 


OY aly + Ez. 


IX.20.5 Geodesics of conformal metric 


Show that the trajectories of the dynamical velocity C := fu are 
geodesics of the metric g := {2g conformal to the original spacetime 
metric g. 


Solution 
If 9 = f7g, it holds that 
V pCa — VpCa = (Mea — Aa) Ca 


= — f7? (OafCg + OgfCa— Jagg” OufCr) , 
(IX.20.13) 


and hence, using (9.5) and (9.4), 
CPV pCa = C8Valg — fa fC’ Cg = 0. (IX.20.14) 
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IX.20.6 Cosmological equation of state p = (y — 1)u 


Take the cosmological equation of state p = (y — l)u. 
1. Compute the index f. 


2. Compute the energy equation. 


Solution 


1. If p= (y—1)p, then the index f is given by 


jeon| fe), 


and hence, up to an irrelevant multiplicative constant, 
f= pOTD/A, p= fOD, 
2. If p= (y — 1)u, then 


(IX.20.15) 


Relativistic kinetic theory 


X.1 Introduction 


In kinetic theory, the matter, usually called a plasma, is composed of 
a collection of particles whose size is negligible at the considered scale: 
rarefied gases in the laboratory, stars in galaxies, or even clusters of 
galaxies at the cosmological scale. The number of particles is so great and 
their motion so chaotic that it is impossible to observe their individual 
motion. 

We have introduced in Chapter IX equations for perfect fluids in Gen- 
eral Relativity as a direct consequence of the perfect fluid equations in 
Special Relativity and the equivalence principle. We have seen that they 
are quite satisfactory mathematically as well as physically. We have also 
seen that there are no compelling macroscopic considerations leading to 
satisfactory relativistic equations for dissipative fluids, even in Special 
Relativity. The classical Navier-Stokes equations lead, like the Fourier 
law of heat transfer, to equations of parabolic type, and hence to an infin- 
ite propagation speed of signals, which is incompatible with relativistic 
causality. The formal generalizations of these equations with the aim of 
obtaining hyperbolic equations lack general justification and there is no 
consensus about them. 

An a posteriori justification of the non-relativistic Euler and Navier— 
Stokes macroscopic equations is to deduce them from the motion of the 
fluid particles at the microscopic scale and the statistical hypothesis of 
classical kinetic theory. It presents no conceptual difficulty to extend 
the setting of kinetic theory to Special as well as to General Relativity. 
It is straightforward to construct an energy-momentum vector and a 
stress-energy tensor for kinetic matter and to couple the latter with 
the Einstein equations. The collective motion of collisionless particles is 
naturally modelled and leads to conservation equations for the stress— 
energy tensor. A case of particular interest in General Relativity is when 
the ‘particles’ are stars in galaxies, or even galaxies in clusters of galaxies. 
It is then appropriate to take the charges to be zero and the masses to 
have values! between two positive numbers. 

When the particles undergo collisions, one can write a Boltzmann 
equation, and a coherent Einstein—Boltzmann system, for an appropri- 
ate choice of the collision cross-section. The problem is the choice of a 
physically reasonable cross-section; this problem arises already in the 
non-relativistic case, but is especially delicate when it is stars or galax- 
ies that collide. The Boltzmann equation arises in Special Relativity for 
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1And not discrete values, while this is 
the case for electromagnetic plasmas. 
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?Representation by a many-particle 
distribution function is possible, but 
difficult to handle. Representation by 
a one-particle distribution function is 
linked with the onset of chaos. The 
mathematical justification of the onset 
of chaos in relativistic dynamics is a 
largely open problem. 


3In local coordinates, g(p,p) = 
Japp“p’. 


4In the sense of the Gibbs ensemble. 


plasmas of elementary particles with a finite number of distinct proper 
masses and charges; the gravitational field is then negligible under usual 
circumstances. However, the general relativistic Boltzmann equation is 
possibly important for obtaining equations for dissipative relativistic 
fluids as approximations. 

Throughout this chapter, as in previous chapters, we assume that the 
spacetime (V,g) is an oriented and time-oriented Lorentzian manifold, 
the tangent bundle Ty is oriented by the orientation of V, and all fibres 
are positively oriented. We treat the (n+1)-dimensional case, specifying 
n = 3 when it enjoys special properties. 


X.2 Distribution function 


X.2.1 Definition 


It is assumed that the state of the matter in a spacetime (V,g) is 
represented? by a ‘one-particle distribution function’. This distribution 
function f is interpreted as the density of particles at a point x € V 
that have a momentum p € TV, the tangent space to V at x. Ina 
relativistic theory, the momentum p is a future timelike or null vector 
whose components define the energy and momentum of the particle with 
respect to a Lorentzian frame. With our signature conventions, it holds 
that? g(p,p) = —m?, with m the rest mass of the particle. In view of 
applications to astrophysics, we do not assume a priori that it is the 
same for all particles. We state a definition: 


Definition X.2.1 A distribution function f is a non-negative sca- 
lar function on the so-called phase space Py, a subbundle of the tangent 
bundle Ty to the spacetime V: 


f:Py—>R by (a,p)% f(a,p), with ceV, pEeEP, CT,V. 
(X.2.1) 


If (V,g) is a Lorentzian manifold, then the fibre Pa at x is such that 
Jx(p,p) < 0 and, in a time-oriented frame, p? > 0. 


Remark X.2.1 Jf a particle has positive rest mass, then it holds that 
p? > 0 in a time-oriented frame. 

If a particle has zero rest mass, then p? can vanish only if p itself 
vanishes. The value p = 0 is, from the mathematical point of view, a 
singular point of the vector field X. On the other hand, relativistic phys- 
ics says that particles with zero mass move with the speed of light, the 
component p? in an orthonormal frame is the energy with respect to an 
observer at rest in this frame, and quantum theory tells us that it does 
not vanish, i.e. p? > 0. 


X.2.2 Interpretation 


The physical meaning of the distribution function is that it gives a mean* 


‘presence number’ density of particles in phase space. More precisely, 


we denote by 0 the 2(n + 1)-volume form on TV, i.e. with 6, and 6, 
respectively the volume forms on V and T;V, 


0 := Oz A Op. (X.2.2) 
In local coordinates, 6, and 6, are given by 


Oz = |Det g|? dx? ^dz!... Adz”, Op := |Det g|? dp? A dp! ... A dp”. 
(X.2.3) 


X.2.3 Moments of the distribution function 


The moments of f are functions or tensors on V obtained by integration 
on the fibres of the phase space Py of products of f by tensor products 
of p with itself. 


Moment of order zero 


This is by definition the integral on the fibre P, of the distribution 
function:® 


jis | fp- (X.2.4) 
Pa 
It is a ‘density of presence’ in spacetime. 
First and second moments 


The first moment of f is a vector field on V defined by 


P(t) =| p“ f (x, p)Op- (X.2.5) 
If the spacetime is time-oriented and the particles have non-negative rest 
mass and travel towards the future, i.e. if p% is future causal (timelike 
or null), then the fibre P,, is included in the subset p? > 0. The vector 
fp® is then also causal and future-directed, since f > 0, and the same 
is true of the first moment P%. 
Out of the first moment, one extracts a scalar r > 0 interpreted as 
the square of a specific rest mass density given by 


2 = —P"P,. (X.2.6) 


If P is timelike, then r > 0 and one deduces from the first moment a 
unit vector U interpreted as the macroscopic flow velocity and given by 


CF ea Pe, (X.2.7) 


A sufficient condition for P to be timelike is that the particles have 
positive mass, since then all p, and hence also P, are timelike. 

The second moment of the distribution function f is the symmetric 
2-tensor on spacetime given by 


TH (x i= fs f(x, p)p~p"0 (X.2.8) 


It is interpreted as the stress—energy tensor of the distribution f. 
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>Recall that the integral on an oriented 
(n + 1)-manifold Pz, with orientation 
defined by the order of coordinates 
v,...,2”, of the exterior differential 
form f(p)@p is equal to the calculus 


integral 


[ FP) = Í, EOL 
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Theorem X.2.1 Ifa distribution function f depends on p only through 
a scalar product Uap“, with U% a given timelike vector on spacetime, 
then the first and second moments of f read respectively as the momen- 
tum vector and the stress—energy tensor of a perfect fluid. The unit flow 
velocity is collinear with U. 


Proof. Assume that 


f(x, p) = F(a, Uo(x)p*). (X.2.9) 
The first moment of f is then 
P*(a) =| p° F(a, Ua(x)p%) Op. (X.2.10) 
Pe 


Take at x an orthonormal Lorentzian frame with time axis collinear with 
U. In this frame, Ua (x) = —A(x)6°, with A := (-U,U%)2, and hence we 
have the following: 


1. The first moment of f has components at x such that 


P*(2) =i p’F(x,—A(x)p?)0p = 0, (X.2.11) 
Px 
because the integrand is antisymmetric in the pê while F is positive. 
The vector P is therefore collinear with the time axis. The component 
P? in the orthonormal frame, 


P(x) =f p?P(x,-N(0)p")Op, (X.2.12) 
Px 
is the macroscopic rest mass density (the presence number density if 
all particles have the same mass normalized to 1). 


2. In the same frame, the same antisymmetry considerations imply that 
T” = 0 and T% = 0 for i Æ j. The equality of the components 
TË i=1,...,n, 


= | (p')? F(x, —A(x)p") Op, (X.2.13) 
Pa 
results from the invariance under rotations in momentum space of 
the function F and the volume element 6,. The component T° is the 
positive function 


pos =] (p°)? F(x, —A(x)p°) Op. (X.2.14) 
Px 
The tensor T is therefore the stress-energy tensor of a perfect fluid 
with flow vector u = \~1U, specific energy given by (2.17), and 
pressure computed from 2.13. 


Remark X.2.2 The physical interpretation of the first moment that 
6Marle (1969). we have given coincides with the interpretation chosen by Marl and 
TEckart (1940). with Eckart’s original definition.’ For a discussion and interpretation 


of the second moment in the general case (dissipative fluids), see the 
discussion in Marle’s paper on pp. 137-148. 


Higher moments 


Higher moments are defined as totally symmetric tensors on V 
given by 


arap f f(a, p)p™ -pP Op. (X.2.15) 
Pr 


They play an important role in the Müller-Ruggeri extended thermo- 
dynamics. 


X.2.4 Particles of a given rest mass 


Stars or galaxies do not have the same rest mass, but the original kinetic 
theory was constructed for the case of gases in laboratories, composed 
of molecules with the same rest mass m. In that case, the phase space 
over (V, g), denoted by Pm,v, has for fibre Pm,» the mass hyperboloid 
(also called the mass shell) 


In the case of particles of a given mass m, the volume form 6,,,, on the 


mass shell Pm,» is, taking the př as local coordinates on Pm,» (then p? 
is a function of x and p’), 


1 
Sp haan (X.2.17) 
Exercise X.2.1 Prove this formula using the Leray definition 


a| 5(o(0.0) -m| A Bn = Op (X.2.18) 


The moments can be defined by the same formula as in Section X.2.3, 
by integration on the mass shell and replacing 0, by Om,p. 

One can also consider the case of N different particles with different 
masses mz, I = 1,2,...,N; there are then N different phase spaces 
Pm,,v, and different distribution functions fr. The moments to consider 
are the sums of the corresponding moments; for example, the second 
moment is 


TP (x op (£, p)p*p" bm; p- (X.2.19) 


For details see the article by Marle.® 


X.2 Distribution function 


8Marle (1969). 
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X.3 Vlasov equations 


In the Vlasov models, one assumes that the gas is so rarefied that the 
particle trajectories do not scatter (i.e. one neglects collisions between 
particles). Their motion is determined by the average fields they generate 
on spacetime. 


X.3.1 General relativistic (GR)—Vlasov equation 


In a curved spacetime of General Relativity, in the absence of non- 
gravitational forces, each particle follows a geodesic of the spacetime 
metric g. We have seen (Chapter I, Section I.8) that the differential 
system satisfied by a geodesic in the tangent space TV of a pseudo- 
Riemannian manifold (V,g) reads in local coordinates, with A called a 
canonical affine parameter, 
a Ex 

p” i= i w =G*, with G*:= -TX pòp", (X.3.1) 
where I'S, are the Christoffel symbols of the metric g. In other words, 
the trajectory of a particle in TV is an element of the geodesic flow 
generated by the vector field X = (p,G) whose components p*,G® in 
a local trivialization of TV over the domain of a chart of V are given 
by (3.1). 


Exercise X.3.1 Show by direct calculation that X = (p,G) is indeed 
a vector field on the tangent bundle TV. 


In a collisionless model, the physical law of conservation of particles, 
together with the invariance of the volume form in TV under the geo- 
desic flow (Liouville’s theorem; see Problem X.14.1) imposes that the 
distribution function f be constant under this flow, that is, that it sat- 
isfy the following first-order linear differential equation, which we call 
the GR—Vlasov equation: 


Lx f =p rg T TYP pe = 0. (X.3.2) 


We have interpreted the scalar —g(p,p) as the square of the rest 
mass of the particle. The following lemma, the formulation of a prop- 
erty already seen in other contexts, makes this interpretation physically 
consistent: 


Lemma X.3.1 In a GR-Vlasov plasma, the scalar g(p,p) is constant 
under the geodesic flow; that is, 


o , að ) Q Q 
Lx{g(p,p)} := p“ i -G sew =0, GĦ := -T$ pp". 


Exercise X.3.2 Prove this identity. 
Hint: Use the values of the Christoffel symbols. 


For particles with a given rest mass, it is convenient to take as coord- 
inates on the phase space Pm the 2n+1 numbers z“, p’. The distribution 
function fm in such coordinates is 


Fle) = F, p, p° (p")). (X.3.3) 


Exercise X.3.3 Prove that gagp*p® = —m? implies 


Op? py” xr% 2g Oa" 


3p? i 3p? ApH ð 

P Pi P pP P Dru (X.3.4) 
Deduce from this definition and identities that the Vlasov equation takes 
the form, with bounded coefficients,’ 


Ofm I p o fm fi G Ofr pe 
0x9 ` p0 Axi ` p? Api 


X.3.2  EM-—GR-Vlasov equation 


When, in addition to gravitation, the particles are subjected to some 
given force, represented by a vector ® tangent to the spacetime V, the 
vector Y tangent to the particle trajectories in the phase space Py over 
a spacetime (V, g) is 


Y=(p,G+6), ie. (¥4) = (p%, G? + 6%). (X.3.5) 
If the volume form @ is invariant under the flow of Y, that is, if 


Ly0 = 0, then particle number conservation gives for the distribution 
function f a generalized GR-Vlasov equation 


a) a) 
Lyf= prot +(- or + wet =0. (X.3.6) 


Lemma X.3.2 Lyd =0 if O®%/Op* = 0. 
Exercise X.3.4 Prove this result. 
Hint: Use the identity Ly 0 = d(iy@) + iy (d0). 
If the particles have electric charge e and move in an electromagnetic 
field F, then their trajectories have tangent vectors in phase space 
Y = (p°,G* + 8°), with ®* = elg(p, p)? F° pg. (X.3.7) 
We deduce from the expression for ® and the antisymmetry of F that 
OO’? _ 
ðp 


Therefore, the volume form @ is invariant under the flow of Y, and particle 
number conservation gives the EM-GR-Vlasov equation 
of of 


a xr —1 paps = 
age t (Dir pt elg(p, A e i)a = 0. (X38) 


Lyf =p" 


X.3 Vlasov equations 


°Choquet-Bruhat (1971b). 
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10Choquet-Bruhat and Noutchegueme 
(1986). See also YCB-OUP2009, Chap- 
ter X, Section 3.3. 


11A case treated in Marle (1969). 


The following lemma provides a coherent consideration of an EM—GR 
plasma where particles have both the same mass and the same charge. 


Lemma X.3.3 In an EM-GR-Vlasov plasma, the scalar g(p,p) is 
constant along an orbit of Y in TV; that is, 
Og(p, p) 


b (G° + em! Fp) = = 0, 


a 99(P, p) 
p 


Ox 


The proof is an easy consequence of the lemma proved in the purely 
gravitational case, because 


Apk 3 
ög(p p) _ AGrxuP*Ph) _ 2p, and F*"pap, = 0, 
Op Op 


by the antisymmetry of F. 


Exercise X.3.5 Prove that if the particles all have the same mass m, 
then the distribution function fm satisfies on Pm the following reduced 
EM-—GR-— Vlasov equation: 


Ofm _ P Ofm , Git ® Dfm 

ðxl | p? dxt : p? Op? 

The EM-GR-Vlasov equation can be generalized to particles with 
random charges (see Problem X.14.2). 


= 0. 


X.3.3 Yang-Mills plasmas 


Yang-Mills plasmas were not observed in laboratories until high- 
energy particle colliders were able to decompose baryons into their 
constitutive quarks and gluons. The Yang—Mills—Vlasov!? equation is 
analogous to the EM-—Vlasov equation, but the electromagnetic field is 
replaced by a Yang-Mills field taking its values in a Lie algebra G and 
the electric charge e, a constant, is replaced by a function q on the space- 
time V with values in the Lie algebra G. The phase space for the kinetic 
theory is now the product Py x G. Although the gravitational field does 
not seem to play a role at the scale of Yang-Mills plasmas, we write the 
general relativistic equations here. The trajectory of a particle in this 
phase space is a solution of the differential system 


dx” dp dq 

—=p", SS e ag Fe = = Q := —p*|Aa q], (X.3.9 
ag ne. “ae +a: pg, 7 = Q:= —p*[Aa,.a], (X3.9) 

where [.,.] is the bracket in the Lie algebra G, ie. [Aa,q]* := 

cf AP qs. The distribution function f is now a function of x,p,q. In 


the absence of collisions and other forces, f satisfies the Einstein-Yang— 
Mills—Vlasov equation 
o o 
prot + (G* +q- F%pg)—— +Q*— =0. (X.3.10) 
Ox p 
A realistic plasma may contain N charged particles of different 
kinds,'! with rest masses my and electric charges ez; their momenta lie 


XA 


in mass hyperboloids gagp*p? = —m?. If we take the pê as coordinates 
on these mass hyperboloids and denote by fm,(#,p) their distribution 
functions, then each fm, satisfies a reduced EM-GR-Vlasov equation 
on a phase space Pm,- 


X.4 Solution of a Vlasov equation 


X.4.1 Construction 


Let (V, g) be a Lorentzian manifold. Assume given on the tangent spaces 
to V, TV, at points of an initial hypersurface Mp C V, a function f 
that will be the initial value of a distribution function f solution of the 
Einstein—-Vlasov equation, at least in a neighbourhood of Tm, V. 

The GR-Vlasov equation is a linear first-order partial differential 


equation 


Of (x, p) 


Ox® 


Of (x, p) 


—_—.—— = 0. 
Op 


Lx f = p” + G° (x, p) 
As is classical for solving such an equation, we transform it into an 
ordinary differential equation by solving the differential system called 


the characteristic system: 

dx“ P 

—= = = G*% = -TI pp". X.4.1 

BP e a Su PP ( ) 
Let x° = 0 be the equation of Mo. Denote by €’ and m® coordinates of 
a point of € € My and a point m € TeV, the tangent space to V at €. 


Assume given a function f(€,7); then the quasilinear first-order differ- 
ential system (4.1) has, for A small enough, one and only one solution, !? 
a geodesic flow that takes, for A = 0, the given values (£, 7): 


x“ = $“ (A, £, T), p~“ = f(A, 6T), E := (EJ a= (x), 
with 
¢°(0,£,7) =0, H (0, E, T) =E, ¥%(0,6,) = 7°. (X.4.2) 


Inserting these functions ¢° and %®“ into the Vlasov equation gives the 
following ordinary differential equation along the trajectory issuing from 


(7): 


{C YA, S mh 
dà 


that is, f is constant along the trajectory, i.e. 


FOYA, E T)} = FAA V0, €, 7)}. 


The mapping A : (A, £',7%) + (x%, p®) reduces to the identity for 
A = 0; it is therefore invertible!’ (if C1) for small enough A, i.e. in 
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12A local C! solution exists, i.e. a 
mapping (€,n) + («(A),p(A)) for A 
small enough, if the coefficients of the 
differential system are C!, i.e. if the 
spacetime metric is C?. The solution 
is unique if the coefficients are Lips- 
chitzian in x and p, and hence if the 
spacetime metric is C11 (i.e. Ct with 
Lipschitzian derivatives). 


13Tn other words, the geodesic flow has 
no conjugate point. 
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14This global existence holds, in ap- 
propriate functional spaces, for lin- 
ear hyperbolic equations, in particular 
wave equations in a Lorentzian metric 
whose null rays issuing from a point do 
not necessarily constitute (out of this 
point) a smooth hypersurface. 


15 Choquet-Bruhat (1971b). 


a neighbourhood of My. The inverse mapping AT! gives the following 
solution of the GR-Vlasov equation: 


f(x,p) = FEl, p), T(x, p)). (X.4.3) 
The same procedure gives the general solution of an EM-GR-Vlasov 
equation. 


Exercise X.4.1 Write the characteristic system of an EM-GR-Vlasov 
equation. 


X.4.2 Global existence theorem 


The construction given in Section X.4.1 breaks down when the geodesic 
flow ceases to be regular. However, this breakdown does not imply that 
the Cauchy problem for the GR-Vlasov equation does not have a global 
solution.'4 This solution is not given by the formula (4.3), but its global 
existence can be proven by using energy-type estimates for first-order 
linear equations. We state a theorem,!° leaving to the interested reader 
the formulation of relevant functional spaces. 


Theorem X.4.1 The Cauchy problem for the GR-Vlasov equation on 
the phase space Py of a regularly sliced (n + 1)-dimensional spacetime 
(V,g), with data f on Py, M := M x {0}, admits one and only one 
solution f on Py. 


It is probably possible to prove global existence on the tangent bundle 
of a globally hyperbolic manifold with appropriate hypotheses on the 
initial manifold and initial data. 


X.4.3 Stress—energy tensor 


The stress-energy tensor of a distribution f is its second moment: 
Te%(0):= | Fep) pp (X.4.4) 
Po 


It is defined at x € V if fp%p® is integrable on the fibre P, of the phase 
space, which requires a proper fall off of f in p at infinity in P,, satisfied 
in particular if f has compact support in p; this compactness results 
in a neighbourhood of M from the compactness of the support of the 
initial f. 
More generally, consider the case of interest for astrophysics: 
0 < m? < —g(p,p) < M°. 


It holds that p%p*/(p°)? is uniformly bounded on Py, and hence the 
integrals (4.4) are defined if 


| (p°)? f(x, p) Op 
P. 


æ 


is defined. 


X.5 


X.5 The Einstein—Vlasov system 


X.5.1 Equations 


The Einstein—-Vlasov system comprises the Einstein equations for a 
Lorentzian metric g with source the second moment of a distribution 
function f, coupled with the GR-Vlasov equation for f: 


1 
Sap = Tap, with Sag := Rag — 398R, (X.5.1) 


Tag(x) =j f(x, p)Papglp, Op := |Det g|? dp?dp! ...dp", (X.5.2) 
P 


coupled with 


of + GO =0, G”; -T$ pòp". (X.5.3) 
Ox” 


Lxf =p” 


X.5.2 Conservation law 


The Einstein—Vlasov system is coherent if the stress—energy tensor (5.2) 
satisfies the conservation law 
Ver =0. 


We prove the following more general theorem:!° 


Theorem X.5.1 Ifthe distribution f satisfies the GR—Vlasov equation 
Lxf =0, then its moments satisfy the conservation laws 


Va, MN? = 0), (X.5.4) 
where V is the covariant derivative in the spacetime metric g. 


Proof. We choose at the point x coordinates such that the first deriva- 
tives agau := (0/0z%)gx,,, and hence the Christoffel symbols, vanish at 
x. At that point, in these coordinates, it holds that 


o 


[21X2 Op 
Ox™ f 


Va, M% Oe = 


It also holds that (0/0x°)0, = 0p, since (0/0x°) (Det g)? = 0 when 
Oo9ry = 0. Therefore, 


o o 
M102 +p = 
Ox Ox 


0 oan Qp 
=f ggo f (2 P)P -P Op, 


J Fap) ep0 
Pe 


but when the Christoffel symbols are zero, 


ð 
p Ayer f(x, p)p™ ...p°? Oy = | Lx f(z, p)p™ ...p°P Op.  (X.5.5) 
Pz, OF Px 
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16For the first moment, this expresses 
the conservation law of the macroscopic 
momentum. 
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17Choquet-Bruhat (1971b), for par- 
ticles with a given rest mass. For par- 
ticles with rest mass in a range of 
positive numbers, see YCB-OUP2009, 
Chapter X. 


We have obtained a relation between tensors on V at an arbitrary point 
in particular coordinates; in arbitrary coordinates, it reads 


Via MO OP = | Lx f(x,p)p™ ...p°? Op. (X.5.6) 
Pe 


X.6 The Cauchy problem 


In a Cauchy problem, one looks for solutions of a system of differential or 
integrodifferential equations taking given data on an initial hypersurface. 
A local-in-time existence theorem for a solution of the Cauchy problem 
for the coupled Einstein—Vlasov system was proved long ago;!” here we 
will only give the idea of the proof and state the main results. 

As in the general case of Einstein equations with source a stress- 
energy tensor satisfying the conservation laws, the solution of the 
Cauchy problem for the Einstein equations with a kinetic source splits 
into an elliptic system called the constraints for Cauchy data on the 
initial hypersurface and a hyperbolic evolution system for the metric 
in a suitably chosen gauge, for example a wave gauge. This evolution 
system is coupled with an evolution system for the source, here a 
Vlasov equation. The solution in wave gauge is proved, as always, to be 
a solution of the full Einstein equations by using the conservation law 
satisfied by the stress—energy tensor—here it is a distribution function 
solution of the GR-Vlasov equation. 


X.6.1 Cauchy data and constraints 


An initial data set for the Einstein—Vlasov system (see Chapter VIII) 
is a quadruplet (M,9,K,f), where M is a hypersurface of a differen- 
tiable manifold V, g and K are respectively a properly Riemannian 
and a symmetric 2-tensor on M, while f is a function on the bundle 
Pm := UremPx(V) corresponding to the kind of particles we consider. 
As always, the data must satisfy on M the Hamiltonian constraint 


R(g) — |K|} + (trg K)? = 2N?T™ (X.6.1) 
and the momentum constraint 


V.K — Vir K = NT", (X.6.2) 
TP (x) = F(p) Op, «eM, pE P,(V). 
Pz 
A general property of the Einstein equations is that the conformally 


formulated constraints decouple if the initial manifold has constant mean 
extrinsic curvature and the source has zero momentum, i.e. T” = 0. 
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Exercise X.6.1 Show that T(x) =0 if f(a, p®, pt) = f(a, p®, —p’). 


In the Maxwell—Einstein—Vlasov case, the Cauchy data on Mp are, 
in addition to g, K, and f, the data Fag of Fag on Mo. 


Exercise X.6.2 Write the system of constraints satisfied by 9, K, F. 


X.6.2 Evolution 


As is often the case for a coupled system, a general method to solve the 
Einstein equations in wave gauge coupled with a GR-Vlasov equation 
is by iteration: find a solution fı of the Vlasov equation for a given 
Lorentzian metric gi; then look for a solution gə of the reduced Ein- 
stein equations with source f1; then iterate and study the convergence 
of the iteration, either numerically or mathematically. The mathemat- 
ical method used by Choquet-Bruhat!® to prove the existence of these !8Choquet-Bruhat (1971b). 
iterates and their convergence is through classical energy estimates for 
wave equations for the metric and weighted-in-p® energy estimates for 
the Vlasov equation to be satisfied by the function f. It is probably pos- 
sible to use instead constructive methods, relying on the parametrix in 
the Einstein case and the formula (4.3) for the Vlasov part. 


X.6.3 Local existence and uniqueness theorem 


We state the theorem, leaving the formulation of relevant functional 
spaces to the interested reader: 


Theorem X.6.1 The Cauchy problem for the Einstein—Vlasov system 
with initial data (g, K) on M, f, on Pm, satisfying the constraints, ad- 
mits a solution (g, f) on Ve x Py, with Ve := M x [0, 4]), and £ small 
enough. 


X.6.4 Global theorems 


The collisionless kinetic theory having no global problems of its own, 

one may hope to extend to the Einstein—Vlasov system global results 

obtained for the vacuum Einstein equations. Such results are already 

hard to prove. Global existence theorems, or proofs of the cosmic cen- 

sorship conjectures, for the Einstein—Vlasov system have been obtained 

only in the presence of an isometry group. The first global existence 

theorem, for small initial data, was proved by Rein and Rendall!’ in the !°Rein and Rendall (1992). 
case of spherical symmetry. A paper by Dafermos and Rendall?° proves 2°Dafermos and Rendall (2006). 
cosmic censorship in the case of surface-symmetric compactly supported 

initial data. References to other global results concerning the Einstein- 

Vlasov system can be found in that paper and in a review by Rendall.?! 21 Rendall (2005). 
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?2The Maxwell-Vlasov system in Spe- 
cial Relativity has been studied exten- 
sively. 


?3Recall that VaS*? = 0 and 
VaV er = 0 for any metric g and 
2-form F. 


X.7 The Maxwell—Einstein—Vlasov system 


Charged particles in a kinetic model generate by their motion an average 
electric current J, and hence an electromagnetic field, a 2-form F. 


X.7.1 Particles with given rest mass and charge 
If all particles have the same charge e and rest mass m, the current is 


J*(g) = me f POD Om p- (X.7.1) 


m,© 


The electromagnetic field satisfies on V the Maxwell equations 
dF=0, 6F=J, ie. VaF%® = SI. (X.7.2) 


The Maxwell—Einstein—Vlasov system?” is the coupled system of these 
Maxwell equations with the EM—GR-Vlasov equation together with the 
Einstein equations with source a stress—energy tensor that is the sum of 
the second moment of the distribution function f and the stress—energy 
tensor T of the electromagnetic field: 


Soe = TW = | f(x,p, e)p"p" 0p der, 
Pox R 
708 = FO, FOr — TYPE pE. 
The following theorem makes the system coherent:?% 
Theorem X.7.1 


1. If the distribution function f satisfies the EM-GR-Vlasov equation, 
then the current J has zero divergence. 


2. If, in addition, F satisfies the Maxwell equations, then the stress— 
energy tensor T is divergence-free. 


Proof. 1. The electric current is given by the product with m~'e of the 
first moment of f: 


i= mte f p“ f(x, p) Op. (X.7.3) 
Hence, by (5.6), 
Vad? = me f Lyx f(z, p)0p, (X.7.4) 
which implies, if Ly f(x, p) = Lx f(x, p) + ®°Of (x, p)/Op* = 


Va J = -m !e A L po en) bp. (X.7.5) 
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Using integration by parts and the equality 06° /Op* = 0 gives 


a OF (2, p) =- f OK an 
Í, b Ape op = ay f(x,p) Ope p = 0, (X.7.6) 


and hence 
Vad = 0. 


2. Analogous reasoning gives for the second moment when Ly f = 0 


o 
Vou = oe F869, (X.7.7) 
Px Op 
The calculus identity 
a O O xa Oo 
ð oF pp = — (d° fp*) — — fp? — DPF, (X.7.8) 
Op Op Op 
integration by parts, the property 0®°*/Op* = 0, and antisymmetry of 
F show that 
VaM”? = -f © fop = -e f Fy? pp. (X.7.9) 
Px Px 


We have seen in Chapter III that the divergence of the Maxwell tensor 
is, if F satisfies the Maxwell equations, 


Var = PREF = ef Faf pò fy. (X.7.10) 
Pe 


We have proved that 
Vo(MrF +7) =0 if Lyf =0. G 
The local existence theorem (Theorem X.6.1) extends to the Maxwell- 


Einstein- Vlasov system modulo hypotheses on the electromagnetic data 
(see the fluid case). 


X.7.2 Particles with random masses and charges 


If the particles have random masses and charges, then the electric current 
is given, at each point x € V, by 


J*(a) = I nim” e)p" 6, de. (X.7.11) 


See Problem X.14.2 for the local existence theorem in this case. 
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24This law assumes that there is no 
change in internal properties of the 
particles. 


25For a mathematical study of the 
Boltzmann equation in classical mech- 
anics, one may consult the review art- 
icle by Villani (2002). 


X.8 Boltzmann equation. Definitions 


When the particles undergo collisions, their trajectories in phase space 
are no longer connected integral curves of the vector field X: their mo- 
menta undergo jumps on crossing other trajectories. Consequently, the 
derivative of the distribution function f along X is no longer zero. In 
the Boltzmann model, this derivative is equal to the so-called collision 
operator Jf: 


Lxf =Tf. (X.8.1) 


Tf is a quadratically nonlinear integral operator that is being interpreted 
as being linked with the probability that two particles of momenta re- 
spectively p’ and q’ collide at x and give, after the shock, two particles, 
one with momentum p and the other with momentum q. One says that 
the shock is elastic when the following law of conservation of momentum 
holds:?4 


p+qd=pt+gq. (X.8.2) 


This equation defines a submanifold © in the fibre (xP,)*. For fixed 
p,q, one denotes by Up, the submanifold of (xP,,)? defined by (8.2). 
The volume element €’ (Leray form) in Upq is such that 


EA (AA + 4%) = Op A By (X.8.3) 


The collision operator is 
Ehar=f f Pered) - tetea] 
Px(q) pq 


x A(z, p, q, p, gE ^ Oq. 


The function A(x, p,q, p',q') is called the shock cross-section. It is a 
phenomenological quantity that depends on the physics of the shocks. No 
explicit expression is known for it in Relativity.” A generally admitted 
property is the reversibility of elastic shocks, namely 


A(x, p,q, p',q') = A(z, p,q’, p,q). (X.8.5) 


Lemma X.8.1 When the particles have all the same, non-zero, proper 
mass m, the integral on Mpg can be written by using a formula of the 
type, with 0, p canonical angular parameters on the sphere S°, 


A(x, p,q, p', qE = S(x, p,q, 0, p) sin 0 dé A dy. 


Proof. Introduce at the point x, for a given pair p,q of timelike vectors, 
an orthonormal Lorentz frame with time axis eg in the direction of p+ q. 
Set p +q = 2Xe0, 2 = [—g(p + q)]!/?. Then also p' +q’ = 2\eo, and 
since p,q, p',q' are timelike vectors with length m, 


(X.8.4) 


Pda = Pda = M — 20? < =m’. 


In such a frame, p° + q}? = p®+q° = 2A, while p? + ¢ = p’ + 
q = 0; hence (p’)? = (q*)?, and therefore p? = q}? = A, while 


X.9 Moments and conservation laws 


X(p')? = \? — m? =: a. The same properties hold for the primed 
variables; in this frame, Up, is represented by a2-sphere, U(p”)? = a? 
of radius a in the plane p° =  . It holds that 

1 


a= 3 (Pda E m?) 


2 _ 1 


59(P — 4p- q)". 


Take a vector parallel to p — q (which is orthogonal to p + q = 2AeQ) as 
axis for polar coordinates 0, p on Xpq, denoting by 0 the angle between 
the space vectors p — q and p’ — q'; the definition of €’ then gives 


E = (2A) ta sin 0 d0 A dy. 
The given relation holds with 
S(p,q,0, p) = A(z, p,q, p', q) (2A)+a, (X.8.6) 


where on the sphere Up,q both p’ and q’ are given by the 6-, y-dependent 
4-vectors 


1 1 
-{ s9(p +9)", =9(p — ap a)"/? (cos 8, sn 0s sin Osin g) Y 


(X.8.7) 
m 


X.9 Moments and conservation laws 


The moments of a distribution function f have been defined in Sec- 
tion X.2 by integrals over a fibre in phase space. The moment of order 
n is 


parinig) = f gean nh (X.9.1) 
Pz 
It satisfies the identity 


VON Pee en = | pez. sep Ly T Ep) (X.9.2) 
Pa 
The right-hand side is zero if f satisfies the Vlasov equation. 

We have interpreted the first and second moments as giving respect- 
ively the proper rest-mass-momentum vector and the stress-energy 
tensor of the macroscopic matter corresponding to the distribution 
function f. We prove a lemma, important for the coherence with the Ein- 
stein equations, when f satisfies a Boltzmann equation with reversible 
collisions: 


Lemma X.9.1 The first and second moments of a distribution f sat- 
isfying a Boltzmann equation on a spacetime (V,g) have zero divergence 
in the spacetime metric if the collisions are reversible. 
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26Bancel (1973) for the Boltzmann 
equation and Bancel and Choquet- 
Bruhat (1973) for the coupled system. 


Proof. The first moment of f is the vector field on V, interpreted as 
proper rest-mass—energy—momentum density, given by 


Pe = p“ f(x, p)Op- (X.9.3) 


© 


The equation (9.2) reads in this case 


VaP*= | Exfl | (ENED (X.9.4) 
Pe z 
The second moment of f, interpreted as the stress—energy tensor, is 
defined by 


TP =f pp? f(a, p) Op. (X.9.5) 


x 


For f satisfying a Boltzmann equation, we have 


Valz f p? (If) (x, p)wp. (X.9.6) 
Px 
Standard calculus, which we leave to the reader as an exercise, shows 
that, for reversible collisions, (9.4) and (9.6) have zero right-hand sides. 

In the case of particles with non-zero rest mass, we have set P® = ru“, 
with u the unit flow vector of the macroscopic matter corresponding to 
the distribution function f. Equation (9.4) is identical to the matter 
conservation law found for fluids: 


Val(ru’) =0. (X.9.7) 
Equation (9.6), 

Vat? =0, (X.9.8) 
is to be satisfied by all stress-energy tensors in General Relativity. W 


The higher moments do not satisfy conservation laws, but form an 
infinite hierarchy; see Section X.11. 


X.10 Einstein—Boltzmann system 


The Einstein equations with source the stress-energy tensor of a dis- 
tribution function f satisfying a Boltzmann equation form a coherent 
system if the collision operator is such that the stress-energy tensor is 
conservative. We have said that this property holds if the collisions are 
reversible. 

It has been proved?® that the local Cauchy problem, with initial data 
set (M,9,K, f), is well posed for such a Einstein—Boltzmann system. 

The same kinds of results hold for an Einstein-Maxwell—Boltzmann 
system obtained on replacing the GR-Vlasov operator by the EM- 
Vlasov operator. 


X.11 Thermodynamics 


One of the main reasons for the interest in relativistic kinetic theory is 
the possibility of obtaining the laws of thermodynamics in a relativistic 
context. The problem is already difficult in Special Relativity, and new 
challenges arise in General Relativity because of the non-existence of an 
equilibrium distribution function in a non-stationary universe. 


X.11.1 Entropy and the H theorem 


The entropy density of a one-particle distribution f was computed by 
Boltzmann to be in phase space the function?” —k(f log f)(x,p). The 
entropy flux in spacetime is the future-directed timelike vector?® 


A(x) := = f p°(f log f)(x, p) Op. (X.11.1) 


x 


The following theorem is the relativistic formulation of a theorem well 
known in non-relativistic thermodynamics: 


Theorem X.11.1 (H theorem) If collisions are reversible and satisfy 
the symmetry property (8.5), then the entropy flux H® is such that 


VaH® > 0. (X.11.2) 


Proof. The divergence of H is found to be, by a computation similar to 
a previous one, 


(VaH®)(«) = —k f rl bemani 
(X.11.3) 
=k Í, (Lx flog f + 1]) (0) 6p. 


x 


Using the Boltzmann equation and the reversibility of collisions, which 
implies that 


| | Hesi di-an ed Aerar 1) AG AD = 0, 
Px(q) Upq 


(X.11.4) 
we find that 


(VaR) =- | f Wep) - Feia] 
Px(q) J Epa 
x (log f)(x, p) A(x, p,q, p’, )E A Oq A Op. (X.11.5) 
Making, moreover, the natural assumption of symmetry, 


A(z, p,q, p,q) = A(z,@, 0,74), (X.11.6) 
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27Boltzmann’s constant k is the quo- 
tient of the perfect gas constant and 
Avogadro’s number, with the value 
1.3806568 JK~!. 


28From its physical definition, it holds 
that 0 < f <1; hence —f log f > 0. 
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2° Because (a — b)(loga — log b) > 0. 


we rewrite the above divergence as follows (the dependence on æ has 
been made implicit for brevity): 


VaH® =- ™ fons) - f(r) FQ) 


fo) | roel 
xog | A(p,q, p, g )E N Wg A Wy, (X.11.7 
C b 
which is non-negative? when A is non-negative. Oo 


The gas is said to be in thermal equilibrium if Va H®“ = 0. A general 
relativistic gas undergoing collisions will not in general attain in a finite 
time thermal equilibrium (see Section X.11.2). 

We can make a simple link between the microscopic and macroscopic 
properties of the gas when the distribution f is isotropic in phase space, 
namely when there exists a timelike vector v on spacetime such that f 
depends only on vap*. We have seen (Section X.2) that the macroscopic 
gas is then a perfect fluid with momentum flow P collinear with V. The 
same type of proof gives the following lemma: 


Lemma X.11.1 Assume there exists on spacetime a timelike vector V 
such that the function f is symmetric in the spaces orthogonal to V, i.e. 
in each fibre, 


f(z,q) = f(z,p) if Vap% = Vag“ and p= —4G, (X.11.8) 


where p and q denote respectively the projections of p and q on the sub- 
space orthogonal to V. Then the first moment P = rU as well as the 
entropy vector H are collinear with V. 


Lemma X.11.2 When the vectors H and P are collinear, one defines 
a positive specific scalar entropy S by setting 
PSP, (X.11.9) 
This specific entropy S satisfies on spacetime the inequality 
P°O,S > 0. (X.11.10) 
Proof. The lemma follows from inequality V,H® > 0 and the conser- 
vation law Va P® = 0. | 


When f does not have the property (11.8), the entropy and matter 
flux are not collinear. 

In all cases, integrating (11.2) on a spacelike slice Vr, with compact 
space or appropriate boundary conditions at spacelike infinity, we find 
that 


H? Nuz > | H°Nuz. (X.11.11) 
Mr Mo 
Exercise X.11.1 Prove this inequality. 


The inequality (11.11) leads some cosmologists to think that it is the 
expansion of the universe that permits its ever-increasing organization 
from an initial anisotropy of f. 


X.11.2 Maxwell—Jiittner equilibrium distribution 


A gas is considered to be in thermal equilibrium if its entropy is 
conserved in the sense that 


Va H° =0. (X.11.12) 


Equation (11.3) shows that a sufficient condition for this equality to hold 
is that the distribution function f be conserved along the trajectories 
of X in phase space, i.e. Lx f = 0. This condition is, in a Boltzmann 
framework, 


T(f) =0. (X.11.13) 


A sufficient condition for this equality to hold is that at each point x of 
spacetime, the function f is such that 


FDF) -— Ffl) =0, if p+q=p td. (X.11.14) 


This condition is also necessary if A(p, q, p', q’) is strictly positive. It can 
also be proved to hold under weaker assumptions.®° It can immediately 
be checked that a solution of the above functional equation is 


f(x, p) = a(x) exp[ba(x)p", (X.11.15) 


with a a positive scalar function and b a covariant vector on spacetime. It 
can be proved that all continuous solutions of (11.14) are of this form. If 
a is non-negative, the same is true of f; if, moreover, b is future timelike, 
then ba(a)p* < 0 and f(#,.) is integrable on P,. Such functions are 
called Maxwell—Jiittner distributions. 

For such a Maxwell—Jiittner distribution f, we find by straightforward 
computation that 


ða 1 
+ <p %p?(Vabg + Vaba)| - (X.11.16) 


=< ba pò a 
fat =e P Ore * 2 


A function f given by (11.15) will satisfy Lx f = 0 on phase space if and 
only if on spacetime it holds that 0a/Ox° = 0 and Vabs + Vgba = 0, 
that is, iff a is a constant and b is a Killing vector field: the spacetime 
must therefore be stationary to admit a thermal equilibrium distribution 
function—an unsurprising result. 

We deduce from Theorem X.2.1 that the rest-mass-momentum vector 
and the stress-energy tensor on spacetime associated with a Maxwell- 
Juttner distribution with a timelike vector b are those of a perfect fluid 
with flow vector collinear to b. The same kind of proof shows that the 
entropy vector H is collinear to b, and hence to u. The integrals giv- 
ing the moments can be expressed through modified Bessel functions of 
the second kind and index n.3! The scalar (—b%b,)2 is interpreted as 
the inverse of the product of the absolute temperature by Boltzmann’s 
constant: 


(—b%ba)? = (kT)7?. (X.11.17) 
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30Marle (1969), p. 94. 


31See Pichon (1967) and Marle (1969). 
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32See, for a summary, YCB-OUP 2009, 
Chapter X, Section 11 (contributed by 
Tommaso Ruggeri) and, for a complete 
exposition, Müller and Ruggeri (1998). 


Remark X.11.1 Jf the particles have zero mass (e.g. photons), one 
can satisfy the condition Lx f = 0 only by imposing that b is a conformal 
Killing vector field, i.e. Vabg + Vaba = Agap. This is the reason why 
Robertson and Walker used the Planck distribution for photons in their 
expanding spacetimes. 


X.11.3 Dissipative fluids 


In classical mechanics, the Navier-Stokes equations can be obtained from 
a distribution function that is a first-order perturbation of the Max- 
well equilibrium distribution, either by the Chapman—Enskog method 
or by Grad’s polynomial expansion method. Both methods have been 
extended by Marle, who found the stress-energy tensor corresponding 
to a first-order perturbation of the Maxwell—Jtittner distribution. The 
corresponding, very complex, system of equations have been shown to be 
of hyperbolic type only in some simplified cases. The fact that thermal 
equilibrium is not compatible with non-stationary spacetimes limits the 
validity of such equations in General Relativity. 


X.12 Extended thermodynamics 


The objective of extended thermodynamics’? is to obtain equations for 
fluids in General Relativity that respect fundamental physical laws and 
also the relativistic causality principle. It uses for general relativistic 
fluids, in addition to the usual first moment P®(a) and second moment 
T°? (x), higher moments and relations between them that are physically 
and mathematically meaningful. 


X.13 Solutions of selected exercises 
Exercise X.3.1 


Let x% be another coordinate system in V. Set 


, Ox” 
A® := : 
s Ox 


The corresponding change of components of a vector in Ty V is 


Q 


1 Ox” 
p% = 


7 Age? T Aa P“. 


Therefore, the change of coordinates (x, p*) > (x, pe) in TV implies 


3p” ya! Op” = a ee 


X.13 Solutions of selected exercises 
Hence, for the components (p, G°’) of X in the new coordinates on TV, 


=A% p”, G” = pf ðg( AS p%) — AX TS pòp". 


We know that (see Chapter I) 


a ya’ a a’ AB a 
Tory = Aa Op AS, + AQ AG An T 3y. (X.13.2) 
Some straightforward manipulations shows that 
G* = Thy p? p’. (X.13.3) 
Exercise X.3.2 
ar pIu Ppa PAH 
Lxg(p, p) = pp p" a T 29apP°T Aup P 
a Og 
= p*p*p" ae = Japp’ g% Au, v|p*p" 
= Iau — opa Apu [Du _ IDa) — 
= [A = 2p%p*p" =0. 
prom (Be — [Au al) pp (Se ie.) 


Exercise X.3.3 
Jagp“p? = constant implies 


Op” Op? . Op? 
2goop” api + IP ax T 2goip + 2gijp = 2po apt + 2Pi = 05 


hence 


dp? o Pi 
Op" Po 


We have fm(x,p) := f(x, p°, p’), with gap pòp” = —m?, and hence 


Ofm _ Of pp" gau Of ena Ofm _ Of _ pi OF 
ðr% xz%  2pọ Ox Ap? Op' p — po Op?” 


from which the result follows. 


Exercise X.6.1 
For an arbitrary function such that $(t) = ¢(—t), it holds that 


M sota f ootas [T o(t)t dt = 0, 
[. ee A p(t )t at! = — | T AEE atl 


because 
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X.14 Problems 


X.14.1 Liouville’s theorem and generalization 


We have shown in Problem [.14.1 in Chapter I that the volume form 0 of 
the tangent bundle TV of a Lorentzian spacetime is invariant under the 
geodesic flow; that is, with Lx denoting the Lie derivative with respect 
to the vector X = (p*,G*%), it holds that Ly 6 = 0. 

Using the identity Ly = d(iy 0)+iyd6, show that the volume form 6 is 
invariant under the flow of a vector Y = (p°, G% + ®*) if ® is orthogonal 
to p and has a vanishing divergence. 


Solution 
The volume form on TV is 
6 := |Det gldx° A... A d£” A dp? A... A dp”. 


Recall that Ly @ = d(iy 6) +iyd@. Since the volume form is of maximum 
degree on TV, we have d0 = 0, and hence Ly @ = diy@. The linearity of 
the interior product gives 


iy = ix + i940. 
We already know that d(ix0) = 0. We set 6 = @, + 0p; then 
i(0,0)9 = x A ie %p, (X.14.1) 
with 
(is9p)an..om = B°(p)oxer om 


and hence, because of antisymmetry, 


. Ob? Ob 
d(i Ap) = Opa? =0 if Ope =0. 
When ®* = F% gp’, it holds that 
Ob? 
=f", = 0, 
Op 


X.14.2 Vlasov equation for particles with random 
charges 

Assume that the considered ‘particles’ have, like momenta, random elec- 

tric charges. The phase space Fy is the bundle over V whose fibre Fẹ V 


is the product P,V x I, with J an interval of R. The volume form in Fy 
is the product, with 8 given by (2.2), 


62:=0 Ade: 


The particles create an average electromagnetic field F and are subjected 
to the Lorentz force 


BY = FP jg, 


where jg is the electric current associated to a particle of momentum p 
and charge e located at x, i.e. with u denoting its unit velocity, 


: -i 
ja = eug =elg(p,p| 2p”. 
We denote by Ye the vector field on TV x R with components 
p“, G% + 6%, 1. 


The volume form 6, is clearly invariant under the flow of Ye, as 0 was 
under the flow of Y; that is, Ly@. = 0. Particle number conservation 
gives for the distribution function f the EM—GR-Vlasov equation 
with random charges: 


Ofe P Ofe 


Lyf =p“ Dp? De 


0 fe a oa 
sae Supp" + F% gj?) =0.  (X.14.2) 


X.14.3 Distribution function on a Robertson—Walker 
spacetime with Vlasov source 


Consider a spacetime M x R with a Robertson—Walker metric. 
g=—dt?+R(t)o?, with o° = y;dr'dz’. 


g is a given Riemannian metric on M. 
1. Derive the Einstein—Vlasov equation for a general distribution func- 
tion f. 


2. Look for a solution fm depending only on t and p° for particles of a 
given rest mass m. Show that it satisfies the equation 


Ofm 0\2 2 1 Ofm = 
R aR [(p =m ] z0 apo =0. 


3. Show that fm is an arbitrary function of the scalar R? [(p°)? — m?]. 


4. Suppose that fm vanishes at time to for particles with momentum 
such that 


(p°)? > m? +kR-7(to), k some constant. 


Show that the maximum possible energy p° of particles with a given 
rest mass decreases with expansion, as foreseen physically. 


X.14 


Problems 
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Solution 
1. Straightforward computation of G% = -T$ Pòp“ gives 
of ; j OF = , OF 
Lxf =p? — RR'y,;,;p'p) — —2R1R'pp'—. 
xf =p aa VPP BA PP opi 


For particles of a given rest mass m, i.e. such that 
Rijp p = (PY — m?, 
the equation for a distribution fm depending only on t and p? reduces 
to 
po îfm Ofm _ 
Ot Op? 
Taking R instead of t as a variable, the equation reads as the following 
linear first-order partial differential equation: 
o 10 
fim [(p°)? m?| di 2 
OR p? Op 
The general solution is constant along the rays (bicharacteristics) that 
satisfy the differential system 
dR 0 dp? 
=- P =d. 
R (p°)2 = m? 


These rays are such that 


-RR [(p?)? z m?] 


R 


1 
log R+5 log [(p°)? — m?] = constant, i.e. R? [(p°)? — m?] = constant. 


The distribution fm is therefore an arbitrary function of the scalar 
R? |(p°)? — m?]. Suppose, for instance, that fm vanishes at time to for 
particles with momentum such that 


(p°)? > m? + kR? (to). 
Then the function fm vanishes at time t for particles with momentum 
(p°)? > m? + kR-* (2). 


Hence the maximum possible energy p° of particles with a given rest 
mass decreases with t if R increases. 
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